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This  dissertation  presents  a new  design  approach  to 
knowledge-based  decision  support  systems  for  fault  diagnosis 
and  control  in  distributed  process  environments.  Based  on  the 
observed  manifestations,  the  knowledge-based  diagnostic  system 
hypothesizes  a set  of  the  most  plausible  disorders  by 
mimicking  the  reasoning  process  of  a human  diagnostician.  The 
data  integration  technique  is  designed  to  put  into  the 
knowledge  base  the  spreading  of  the  observable  information 
over  the  shopfloor,  to  interrelate  elements  based  on  the 
entity-category-relation  structure,  and  to  generate  error-free 
hierarchical  category  files.  A novel  approach  to  diagnostic 
problem  solving  has  been  proposed  by  integrating  the  PADIKS 
(Pattern-Directed  Knowledge-Based  System)  concept  developed  at 
the  Center  for  Information  Research  and  the  symbolic  model  of 
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diagnostic  reasoning  based  on  the  categorical  causal  model. 
The  combination  of  symbolic  causal  reasoning  and  pattern- 
directed  reasoning  produces  a highly  efficient  diagnostic 
procedure  and  generates  a more  realistic  expert  behavior.  In 
addition,  three  distinctive  constraints  are  designed  to 
further  reduce  the  computational  complexity  involved  in  the 
multiple  disorders  problem.  As  an  enhanced  version  of  the 
PADIKS  approach,  this  approach  fortifies  the  PADIKS ' s 
diagnostic  capability  by  adding  several  salient  features, 
i.e.,  (1)  multiple  disorders  identification  based  on  the 
categorical  causal  model,  (2)  constraint-based  disorder 
candidates  reduction,  and  (3)  pattern-directed  approach  to 
rank  the  disorder  candidates  on  the  basis  of  pattern  matching 
and  classification.  The  proposed  diagnostic  mechanism,  which 
consists  of  three  different  levels  of  reasoning  operations, 
significantly  reduces  the  computational  complexity  in  the 
diagnostic  problem  with  uncertainty  by  systematically 
shrinking  the  hypotheses  space.  This  approach  is  applied  to 
the  test  and  inspection  data  collected  from  a PCB 
manufacturing  operation. 
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CHAPTER  I 
INTRODUCTION 


1.1  Background:  Automation  Information  Integration 

It  is  becoming  increasingly  obvious  to  those  in 
manufacturing  industries  that  to  stay  competitive  in  the 
future  a high  degree  of  integrated  automation  will  be  required 
in  both  the  physical  production  and  in  the  information 
management  aspects  of  their  operation.  If  the  automation  in 
physical  production  is  installed  without  an  integrated  view  of 
the  information  management  problem,  the  result  will  be 
"islands  of  automation"  which  have  limited  flexibility  to 
change  because  of  their  limited  information-sharing 

capability.  Solving  the  information  dilemma  requires  a 
realization  that  information  is  a corporate  resource  which 
must  be  managed,  controlled,  integrated  and  utilized  much  like 
production  and  inventory  (Young  and  Mayer  [1984],  Tou 
[1985b]) . In  order  to  efficiently  manage  information  resources 
in  an  automated  manufacturing  environment,  we  need  to 
conceptualize  manufacturing  operations  as  distributed 

information  processes  and  then  to  interrelate  them  on  the 
basis  of  their  information  transaction  (Tou  and  Chung  [1990]). 

In  an  automated  manufacturing  environment,  there  is  a 
strong  need  to  fuse  and  integrate  data  and  information 
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spreading  over  the  factory  floor  in  order  to  improve  product 
quality  and  engineering  productivity.  As  shopfloors  have 
increased  in  speed,  complexity,  and  scope,  the  requirements 
imposed  on  data  collection  and  fusion,  information 
integration,  and  decision  support  techniques  for  productivity 
improvement  and  quality  assurance  have  exceeded  the  capability 
of  the  traditional  manual  techniques  (Waltz  and  Buede  [1986]). 
The  increase  of  information  flow  from  various  sources  and  the 
reduced  reaction  time  requirement  have  dictated  the 
intelligent  decision  support  as  well  as  the  automation  of  data 
collection  and  data  integration. 

In  a distributed  process  environment,  the  importance  of 
reliable  automated  fault  diagnosis  and  control  is  increasing 
rapidly  as  the  systems  become  more  complex  and  they  need  to 
operate  with  minimum  malfunctioning  or  breakdown  time 
(Tzafestas  [1989]).  For  example,  in  a chemical  plant,  the 
product  quality  is  maintained  by  assuring  that  process 
variables  fluctuate  within  permissible  ranges.  If  operating 
conditions  go  outside  these  ranges,  the  product  quality  is  not 
acceptable,  or  more  critically,  some  catastrophic  events  might 
result.  The  task  of  fault  diagnosis  and  process  supervisory 
control  is  very  difficult  for  the  human  operator;  even  well- 
trained  experts  have  difficulty  in  solving  unanticipated 
problems  or  rarely  occurring  faults.  The  reaction  time  is 
critical  in  these  circumstances.  Hesitation  or  improper  action 
could  lead  to  the  significant  deterioration  of  product  quality 
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or  to  a disaster.  Here  is  exactly  where  knowledge-based  fault 
diagnosis  and  control  can  offer  a unique  aid  to  the  operators 
for  detecting,  locating  and  identifying  process  malfunctions, 
and  for  applying  efficient  control  actions. 

The  knowledge-based  systems  are  a class  of  computer 
programs  that  emulate  the  problem-solving  skills  of  the  human 
expert  in  specialized  domains.  They  differ  from  conventional 
programs,  which  use  fixed  algorithms  for  manipulating  data,  in 
that  they  can  piece  together  a large  amount  of  fragmentary 
knowledge  in  the  facts  and  judgmental  rules  used  by  human 
experts  to  solve  a given  problem. 

The  design  and  application  of  knowledge-based  expert 
systems  for  fault  diagnosis  and  process  control  have  received 
a good  deal  of  attention  among  knowledge  engineers,  due  to  the 
resulting  improved  efficiency  and  effectiveness.  Process 
control  is  a knowledge-intensive  and  experience-based  task, 
which  in  complex  processes  can  sometimes  go  beyond  the 
capabilities  of  skilled  operators  and  engineers.  Knowledge- 
based  expert  systems,  we  believe,  can  provide  substantial 
assistance  for  decision  making  in  process  control  as  well  as 
for  real-time  process  supervision  (Tzafestas  [1989],  Stock 
[1989] ) . 

1.2  Distributed  Process  Environment 

In  order  to  define  the  scope  of  our  research  and  to 
discuss  our  application  domain,  we  introduce  the  concept  of 
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distributed  process  environment  from  the  general  point  of  view 
of  distributed  information  processing. 

A distributed  process  environment  can  be  considered  as  a 
composite  system  comprising  the  basic  components  depicted  in 
Figure  1.1,  i.e.,  a distributed  process,  a data  collection 

system,  a process  control  system,  and  operators. 

1.2.1  Distributed  Process 

The  distributed  process  consists  of  many  subprocesses  (or 
subsystems)  which  cooperate  to  perform  a pre-determined , 
specific  manufacturing  task  on  the  shopfloor.  Figure  1.2  shows 
the  configuration  of  subprocesses  involved  in  a typical  PCB 
manufacturing  operation  which  is  our  application  domain.  The 
distributed  process  is  characterized  by  several  features  that 
include  (Gallanti  and  Guida  [1985]): 

(1)  complexity  of  structure,  i.e.  it  is  made  up  of  many 
interacting  subprocesses  which  are  geographically 
separated ; 

(2)  complexity  of  description  of  its  subprocesses  and  their 
internal  operation,  often  requiring  sophisticated 
mathematical  models; 

(3)  complexity  and  variety  of  involved  physical  and 
engineering  models : electrical,  thermodynamical,  nuclear, 
mechanical,  etc.; 

(4)  speed  of  evolution,  that  often  imposes  real-time 
constraints  to  the  interacting  systems  in  order  to  ensure 
timely  intervention. 
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Figure  1.1  Schema  of  a Distributed  Process  Environment 
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Figure  1.2  Process  Configuration  of  a Typical  PCB  Manufacturing 
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1.2.2  Data  Collection  System 

The  data  collection  system  is  devoted  to  collecting  data 
from  a distributed  process  in  the  shopfloor  and  to  supplying 
them  to  the  process  control  system.  It  features  three  basic 
characteristics : 

(1)  the  large  number  of  data  collected  from  the  distributed 
process ; 

(2)  the  uncertainty  of  measurement  due  to  operators' 
mistakes,  spurious  readings,  sensor  degration,  etc.; 

(3)  the  possible  overlapping  and  conflict  between  different 
observations . 

Therefore,  one  of  the  main  problems  for  the  process 
control  system  and  the  operators  is  verification  and 
validation,  integration,  and  the  interpretation  of  various 
data  collected  from  the  distributed  process. 

The  data  collection  system  should  provide  a efficient 
data  channel  for  gathering  data  observed  or  measured  from 
various  subprocesses  in  a distributed  process.  A simple 
example  of  the  data  collection  system  is  illustrated  in  Figure 
1.3,  which  is  used  to  collect  test  data  and  inspection  data 
from  a PCB  manufacturing  operation.  In  the  example  of  Figure 
1.3,  SPn  denotes  the  nth  subprocess  in  the  distributed  process 
and  the  bar  code  readers  are  daisy-chained. 

1.2.3  Process  Control  System 

The  process  control  system  comprises  the  traditional 
systems  used  for  on-line  process  control,  also  including  off- 
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Figure  1.3  An  Example  of  Data  Collection  System 
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line  decision  support  tools  such  as  numerical  simulation 
programs.  In  a fully  automated  process  environment,  it  is 
responsible  for  generating  a well-structured  manifestation 
file  from  the  observation,  for  performing  the  diagnostic 
reasoning  based  on  the  observed  manifestations  and  the 
experts'  knowledge,  for  recommending  the  most  plausible 
disorders  for  operators,  and  for  suggesting  the  repair 
procedure  or  the  process  control  procedure  to  operators.  The 
process  control  system  also  provides  an  interactive  user 
interface  for  operators.  An  intelligent  decision  supporting 
system  for  the  process  control  performs  the  generation  of 
various  report  for  quality  assurance  as  well  as  the  diagnostic 
reasoning. 

1.2.4  Operators 

The  group  of  operators  comprises  several  categories  of 
persons  who  have  direct  responsibility  for  process 
surveillance  and  operation,  such  as  maintenance  staff,  quality 
control  engineers,  and  process  control  specialists.  Through 
the  process  control  system,  the  operators  monitor  and 
supervise  the  behavior  of  the  distributed  process.  When  the 
diagnosis  results  with  the  recommendation  for  control 
procedures  which  are  hypothesized  by  the  process  control 
system,  operators  usually  take  the  responsibility  of  making 
the  final  decision  on  any  changes  (modifications,  or  control) 
of  the  distributed  process.  As  the  distributed  process 
environment  becomes  more  computerized,  the  responsibilities 
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(functions)  of  operators  migrate  into  those  of  the  automatic 
process  control  system. 

1.3  Statement  of  Problem 

In  knowledge-based  expert  systems,  diagnosis  is  one  of 
the  largest  application  domains  (Milne  [1987])  . Strategies  and 
capabilities  for  diagnosis  have  been  evolving  rapidly.  Most  of 
the  past  applications  involving  diagnosis  have  been  rule 
based.  That  is,  they  use  simple  production  rules  to  provide  a 
mapping  between  the  possible  disorders  and  possible 
manifestations.  The  leading  wave  of  technology  has  provided 
various  new  techniques  that  are  applicable  to  diagnostic 
problem  solving.  These  techniques  give  us  some  ability  to 
build  and  reason  about  deep  models  and  involve  a wide  range  of 
information,  such  as  probabilistic  information  and  cause-and- 
effect  relationships.  Most  of  the  proposed  techniques, 
however,  are  very  domain-dependent  and  have  unreasonable 
assumptions  for  reducing  computational  complexity  so  that  they 
are  not  applicable  to  general  diagnostic  problems. 

In  this  dissertation,  we  focus  our  attention  on  the 
development  of  the  theoretic  framework  for  general  diagnostic 
problem  solving  which  is  applicable  in  a distributed  process 
environment.  The  techniques  developed  here  are  applied  to  test 
data  and  inspection  data  from  a PCB  manufacturing  operation  in 
order  to  hypothesize  the  most  plausible  disorders  for  helping 
the  operators  perform  process  control  for  quality  assurance. 
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1.4  Previous  Work 

Knowledge-based  expert  systems  have  been  developed  in 
fields  such  as  medical  diagnosis  (Tou  [1978],  Chang  and  Tou 
[1984]),  equipment  fault  diagnosis  (Bennett  and  Hollander 
[1981]),  computer  configuration  (McDermott  [1982]),  mineral 
exploration  (Duda,  Gaschnig,  and  Hart  [1979]),  chemical  data 
interpretation  (Lindsay,  Buchanan,  Feigenbaum,  and  Lederburg 
[1980]),  and  nuclear  power  plant  consultation  (Nelson  [1982]). 
With  real  application  of  these  systems,  great  interest  exists 
in  applying  knowledge-based  techniques  to  fault  diagnosis  and 
process  control  in  the  automated  manufacturing  environment 
(Biswas,  Abramczyk,  and  Oliff  [1978]).  During  the  last  several 
years  a growing  amount  of  research  has  been  invested  in 
developing  methods  for  diagnostic  expert  systems  that  model 
human  experts'  diagnostic  problem  solving  in  a distributed 
process  environment  such  as  computer  integrated  manufacturing 
(CIM) . The  potential  power  of  expert  systems  which  can 
replicate  and  autonomously  apply  expensive  human  expertise  has 
led  to  a worldwide  effort  to  explore  and  extend  this 
technology. 

The  systems  being  built  in  the  field  of  fault  diagnosis 
and  control  for  manufacturing  and  process  environment  include 
REACTOR  (Nelson  [1982]),  a rule-based  expert  system  for  both 
event-oriented  and  function-oriented  diagnosis  of  nuclear 
power  plant  accidents,  PDS  (Gonzalez  and  Lowenfeld  [1986]), 
which  addresses  the  problem  of  on-line,  real-time  diagnosis  of 
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malfunction  in  machine  processes,  PROP  (Gallanti  and  Guida 
[1985]),  an  expert  system  for  on-line  monitoring  of  the 
pollution  of  the  cycle  water  in  a thermal  power  plant,  PICON 
(Moore,  Hawkinson,  Knickerbocker,  and  Churchman  [1984]),  an 
expert  system  for  forward  and  backward  chaining  inference  in 
a distributed  process  control  system,  and  LES  (Laffey, 
Perkins,  and  Nguyen  [1984]),  an  goal-driven  and  backward 
chaining  expert  system  for  diagnosing  a large  signal-switching 
network.  Various  design  methodologies  have  been  proposed  such 
as  the  rule-based  approach,  procedural  systems,  frame-based 
approach,  pattern-directed,  etc.  (Barr  and  Feigenbaum  [1981], 
Tou  [1978]) . One  highly  publicized  technique  is  the  rule-based 
approach  which  contains  a rule  base  made  up  of  a collection  of 
production  rules  coupled  with  an  inference  mechanism 
navigating  through  the  rule  base  for  problem  solving.  However, 
as  Golden  et  al.  (Golden,  Siemens,  and  Ferguson  [1986])  and 
Milne  (Milne  [1985])  have  pointed  out,  there  are  several 
difficulties  embedded  in  this  design  concept,  such  as  a 
shortage  of  inference  methodology,  limited  generic  process, 
etc.  In  spite  of  the  invention  of  Rete  (Forgy  [1982]),  a high- 
speed matching  algorithm  which  gracefully  accelerates  the 
process  for  rule  seeking,  those  fundamental  obstacles  still 
remain  unsolved.  Moreover,  most  current  systems  assume  that 
operations  in  the  process  can  be  represented  efficiently  by 
the  quantitative  (or  qualitative)  model.  However,  the 
distributed  process  environment,  usually  consisting  of  many 
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heterogeneous  components,  is  difficult  to  model  in  a 
deterministic  way.  Therefore,  this  assumption  makes  those 
systems  domain-specific  and  impractical  for  the  general 
applications.  A new  approach,  we  believe,  should  be 
investigated  to  solve  diagnostic  problems  in  the  complex 
process  environment  by  efficiently  fusing  and  utilizing  the 
observable  data,  information,  and  knowledge,  instead  of  by 
modelling  the  whole  process. 

In  contrast  to  the  traditional  design  approach,  Tou 
introduced  another  type  of  design  concept  called  PADIKS, 
standing  for  the  Pattern-Directed  Knowledge-Based  System  (Tou 
[1978],  [1982],  [1985a],  [1985b]),  which  bases  its  design 
philosophy  upon  the  construction  of  a pyramid-like  know-how 
set  in  conjunction  with  pattern  recognition  techniques  for 
knowledge  seeking  and  diagnostic  inference.  This  approach  is 
especially  powerful  in  dealing  with  problems  with 
classif icatory  properties  such  as  medical  diagnosis  (Tou 
[1978],  Chang  and  Tou  [1984]),  agricultural  application  (Tou 
and  Cheng  [1983]),  and  information  telebrowsing  (Tou  and 
Depree  [1978]).  The  pattern-directed  approach,  with  the 
clustering  concept  embedded  in  its  inference  logic,  has  proved 
particularly  effective  in  categorical  type  reasoning.  Several 
intelligent  systems:  MEDIKS,  a medical  knowledge  system  for 
diagnostic  consultation  and  clinical  decision  making  (Tou 
[1978],  Chang  and  Tou  [1984]);  APRIKS,  a knowledge-based 
expert  system  for  application  in  agriculture  (Tou  and  Cheng 
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[1983]);  and  AVIS,  an  automatic  verification  system  for  CAD 
database  (Dai  [1989])  exemplify  this  benefit.  In  the 
diagnostic  problem  of  the  automated  manufacturing  operation, 
especially  of  PCB  manufacturing  operation,  we  have  also  found 
significant  taxonomic  properties  in  the  relationship  between 
defects  and  their  causes. 

In  this  dissertation,  we  draw  on  the  experience  from  the 
previous  systems,  MEDIKS,  APRIKS,  and  AVIS,  and  further  expand 
and  consolidate  the  pattern-directed  design  concept  into  a 
full-fledged  framework  adequate  for  the  intelligent  decision 
support  system  for  fault  diagnosis  and  supervisory  control  in 
a distributed  process  environment. 

1.5  Organization  of  Dissertation 

Chapter  II  discusses  the  nature  of  engineering  diagnosis 
for  fault  diagnosis  and  supervisory  process  control  in  a 
distributed  process  environment.  Three  phases  of  diagnostic 
reasoning  procedures  for  the  process  control  are  discussed, 
emphasizing  the  necessary  steps  of  each  phase.  The  concept  of 
PADIKS  approach  is  closely  examined  on  the  basis  of  its 
knowledge  representation  scheme  and  diagnostic  inference 
mechanism.  The  concept  of  entity-relation  model  is  presented 
and  then  the  entity-category-relation  model  is  discussed  with 
several  examples  to  show  its  applicability  to  general 
diagnostic  problem  solving.  Advantages  and  disadvantages  of 
both  the  symbolic  diagnostic  model  and  the  probabilistic 
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diagnostic  model  are  also  discussed  and  compared.  Based  on 
that  discussion,  we  contrast  the  effectiveness  and  superiority 
of  our  proposed  diagnostic  model,  i.e.  the  combination  of  both 
the  symbolic  model  and  the  probabilistic  model,  with  domain- 
independent,  constraint-based  heuristics. 

Chapter  III  presents  the  design  concept  of  data 
integration  technigues  to  generate  error-free  category  files 
from  occasionally  erroneous  data  gathered  at  various 
subprocesses.  The  data  integration  system  consists  of  four 
different  operations:  data  normalization,  data  verification, 
data  validation,  and  data  association.  The  data  integration 
techniques  are  designed  to  put  into  the  knowledge  base  the 
spreading  of  the  observable  information  over  the  shopfloor,  to 
interrelate  elements  based  on  the  entity-category-relation 
model,  and  to  generate  error-free  category  files.  The  proposed 
data  integration  techniques  are  applied  to  test  data  and 
inspection  data  collected  from  the  PCB  manufacturing  operation 
at  an  electronics  manufacturing  company.  The  feature  selection 
and  extraction  schemes  are  designed  to  provide  the  categorized 
(decomposed)  structure  of  the  observed  manifestations 
(features)  for  facilitating  the  next  stage  of  diagnostic 
reasoning  procedures. 

In  Chapter  IV,  a novel  approach  to  diagnostic  problem 
solving  is  proposed  by  integrating  the  PADIKS  concept  and  the 
symbolic  model  of  diagnostic  reasoning  based  on  the 
categorical  causal  network  model.  Three  different  levels  of 
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reasoning  operations,  i.e.,  the  model-based  candidate 
generator,  the  constraint-based  filter,  and  the  pattern- 
directed  ranker,  are  discussed  respectively.  Combining  with 
the  set  covering  theory,  we  present  the  formal  description  of 
the  categorical  causal  model,  emphasizing  its  capability  of 
reducing  the  computational  complexity  involved  in  diagnostic 
reasoning.  The  design  concept  of  the  constraint-based  filter 
is  also  discussed  on  the  basis  of  three  important  constraints, 
namely,  the  expected  features  constraint,  the  disorder 
dependence  constraint,  and  the  cardinality  constraint. 
Especially,  the  feature  space  partitioning  concept  is 
discussed  in  detail.  We  also  develop  the  concept  of  logical 
cardinality  by  utilizing  the  relationships  between  the 
hypothesized  simultaneous  disorders.  Based  on  the  PADIKS 
approach,  the  pattern-directed  ranker  is  designed  to 
differentiate  the  disorder  candidates.  To  effectively  solve  a 
multiple  disorder  problem,  the  disorder  concepts  and  the 
strength  of  causal  association  are  newly  defined.  To  rank  the 
disorder  concepts  hypothesized  in  the  previous  diagnostic 
reasoning  operations,  the  likelihood  measure  is  designed  on 
the  basis  of  Bayesian  theory.  Some  examples  are  used  to 
illustrate  how  to  reduce  the  hypotheses  space  in  a real-world 
diagnostic  problem. 

After  the  exclusive  and  comprehensive  discussion  of  the 
design  concept  of  the  data  integration  and  the  multi-level 
diagnostic  reasoning  mechanism,  in  Chapter  V,  we  discuss  the 
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implementation  aspect  of  the  proposed  methodologies.  The 
DSS/DPC  (Knowledge-based  Decision  Support  System  for  Fault 
Diagnosis  and  Process  Control)  is  developed  to  perform  two 
primary  tasks,  i.e.,  the  data  integration  operation  and  the 
multi-level  diagnostic  reasoning  operation.  The  system 
architecture  and  the  development  environment  are  discussed.  By 
applying  the  proposed  techniques  to  test  data  and  inspection 
data  from  a PCB  manufacturing  operation,  some  experimental 
results  are  presented. 

Chapter  VI  summarizes  the  major  accomplishments  reported 
in  this  dissertation  and  provides  suggestions  for  further 
research. 

In  Appendix,  a new  reference  model  for  distributed 
process  environment  is  presented  as  an  architectural 
framework.  The  proposed  model  is  designed  upon  the  functional 
decomposition  and  integration  of  each  subprocess. 


CHAPTER  II 

KNOWLEDGE -BAS ED  APPROACH  TO  DIAGNOSIS 
2.1  The  Nature  of  Engineering  Diagnosis 

In  solving  a general  diagnostic  problem,  one  must  first 
observe  and  then  try  to  find  possible  disorders  by  proper 
reasoning.  The  reasoning  can  be  either  empirical  by  using 
accumulated  experience,  or  functional  by  using  knowledge  about 
system  components  and  organization.  Upon  the  diagnosis  result, 
one  reacts  with  a proper  treatment.  In  fact,  engineering 
diagnosis  problems,  such  as  troubleshooting  and  debugging, 
seem  to  confirm  to  this  pattern  of  observation-hypothesizing- 
action.  The  initial  step  involves  observation.  The  purpose  of 
this  step  is  to  find  as  many  manifestations  (symptoms)  as 
possible  to  help  find  the  most  plausible  disorders  (faults). 
Of  course,  we  usually  focus  on  the  manifestations  which  are 
directly  or  indirectly  relevant  to  the  problem  domain.  The 
next  step  is  to  perform  the  diagnostic  reasoning  based  on  the 
observed  manifestations  and  select  the  most  plausible 
disorders  from  a set  of  hypothesized  candidates.  The  last  step 
generates  the  control  directives  to  eliminate  the  causes  of 
defects  (or  manifestations)  and/or  to  remove  defects.  This 
pattern  can  also  be  found  in  medical  diagnosis  (Tou  [1978], 
Shortlif fe  [ 1983 ] ) . 
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2.2  Knowledge-Based  Diagnosis  for  Process  Control 

The  knowledge-based  approach  to  process  control  in  an 
automated  manufacturing  environment  is  to  mimic  human  experts ' 
diagnostic  problem  solving  logic  on  the  basis  of  observations 
collected  from  the  distributed  (manufacturing)  process.  In 
other  words,  the  diagnostic  problem  for  process  control  is 
basically  to  identify  a specific  disorder  (or  a set  of 
disorders)  which  causes  the  observed  manifestation (s)  on  the 
basis  of  both  the  well-organized  knowledge  base  and 
manifestations.  As  discussed  previously,  human  diagnostic 
process  for  problem  solving  might  be  considered  as  consisting 
of  three  phases  as  shown  in  Figure  2.1:  (1)  Observation  phase; 
(2)  Problem  identification  phase;  and  (3)  Control  generation 
phase . 

In  the  observation  phase,  we  mainly  collect 
manifestations  characterizing  an  individual  disorder  (class  or 
concept) . These  collected  manifestations  can  be  represented  by 
an  information  profile, 

{ Xj  | i = 1,2,3, ...,1} 

where  Xf  denotes  the  observed  features  (or  attributes) . And  an 
observation  may  be  described  by  an  information  pattern, 


X [ i ^2  , Xj  , . . • X^  ] 
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data  measurement 
data  collection 
data  integration 


feature  extraction 
pattern  matching/ inference 
cause  identification 


control  procedure 
generation 
outcome  evaluation 


Figure  2.1  Three  Phases  of  Diagnostic  Procedures 
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where  is  a value  of  feature  X;  for  a given  problem  category. 
For  instance,  if  X1  is  a binary  feature,  x,  may  be  1 or  0.  In 
the  problem  identification  phase,  we  determine  the  nature  of 
the  problem  from  the  observed  feature  value,  derive  an 
information  pattern  representing  this  problem,  and  compare 
that  with  pre-stored  information  patterns  to  yield  an  accurate 
solution.  Once  we  are  able  to  identify  a problem,  the 
methodologies  and  procedures  for  removing  the  causes  of  the 
problem  are  determined  in  the  control  generation  phase. 

This  problem  solving  process  requires  a set  of 
information  patterns  (attributes)  in  conjunction  with  the  pre- 
set semantic  structure  characterizing  its  problem  domain.  In 
real  world  problems,  especially  in  manufacturing  operations, 
the  observed  features  (defects,  or  symptoms)  are  not  well- 
organized  and  even  erroneous  due  to  various  reasons  such  as 
operators'  mistakes,  inaccurate  measurement  tools,  and 
different  scales  (or  units) . Therefore,  at  the  preprocessing 
stage,  the  data  integration  operation  should  be  performed  to 
generate  well-defined  information  patterns  (Tou  and  Chung 
[1990]).  By  performing  the  data  integration  operation,  we  can 
generate  the  well-structured  error-free  category  file  for  the 
observed  manifestations.  Based  on  this  category  file,  we  can 
extract  the  feature  vector  characterizing  the  current 
situation  in  the  manufacturing  operation.  Therefore,  the  data 
collection  and  data  integration  operation  are  required  to 
perform  the  observation  phase  of  the  diagnostic  process  and  to 
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provide  an  efficient  data  structure  for  the  feature  selection 
and  extraction  of  the  next  phase  of  the  problem 
identification. 

In  the  problem  identification  phase,  we  can  find  the  most 
plausible  defect  cause  (defect  causes)  from  the  result  of  the 
feature  extraction  by  performing  pattern  matching  and 
classification  operations,  on  the  basis  of  the  following 
information  in  the  knowledge  base: 

(1)  a hierarchical  classification  of  manifestations 
(defects) ; 

(2)  a list  of  significant  manifestations  for  each  disorder 
(defect  cause) ; 

(3)  the  interrelationship  between  disorder  and  manifestations 
with  weight  factors; 

(4)  information  about  manifestation  and  disorder  history. 

In  the  control  generation  phase,  the  system  will  display 

the  diagnostic  result,  i.e.  the  most  plausible  defect  causes, 
and  recommend  the  process  control  procedure  based  on  the  pre- 
stored control  procedure  file  via  user  interactive  interface. 
Furthermore,  by  evaluating  the  feedback  results  from  the  next 
observation,  the  system  decides  the  necessity  of  further 
process  control  and  updates  the  weight  factor  between  defects 
and  their  causes. 

In  the  problem  identification  phase,  the  diagnostic 
reasoning  process  generally  requires  a large  amount  of  numeric 
and  symbolic  computation.  Thus,  one  of  our  main  concerns  is  to 
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design  a novel  diagnostic  mechanism  for  more  general,  complex 
applications  by  effectively  reducing  the  computational 
complexity  involved  in  real-world  diagnostic  problem  solving. 

2.3  Computational  Complexity 
Involved  in  General  Diagnostic  Problems 

Depending  on  the  breadth  of  the  domain  of  a diagnostic 
system  and  on  its  degree  of  refinement,  the  number  of 
disorders  hypothesized  could  range  from  a few  to  many 
thousand.  However,  studies  show  that  human  diagnosticians 
generate  only  a small  number  of  hypotheses  at  any  one  time 
during  the  diagnostic  process  (Patil  [1987]).  Similarly, 
limiting  the  number  of  hypotheses  simultaneously  considered  by 
the  diagnostic  system  at  any  one  time  has  significant 
advantages.  Focusing  our  attention  on  a small  number  of 
relevant  hypotheses  (or  disorders)  saves  a significant  amount 
of  computation  over  continually  re-evaluating  all  possible 
hypotheses.  Thus,  we  can  devote  a large  share  of  computational 
resources  to  each  of  the  hypotheses  considered  and  can  employ 
more  computationally  intensive  strategies  in  evaluating 
individual  hypotheses. 

When  we  consider  general  diagnostic  problems, 
particularly  in  situations  where  multiple  disorders  can  be 
present  simultaneously  and  where  the  assumption  for  the 
hypotheses  independence  cannot  be  justified,  limiting  the 
number  of  active  hypotheses,  i.e.  the  number  of  the  hypotheses 
under  consideration,  is  very  crucial  to  make  the  diagnostic 
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reasoning  process  computationally  tractable. 

How  can  a reduction  in  the  number  of  hypotheses  under 
consideration  be  achieved  in  the  diagnostic  system  without 
sacrificing  performance  ? That  is,  the  primary  objective  of 
this  dissertation  is  to  develop  a new  diagnostic  mechanism  for 
general  diagnostic  problems  by  effectively  reducing  the  number 
of  hypotheses  considered  and  by  focussing  attention  on  a small 
number  of  hypotheses  to  differentiate  these  relevant 
hypotheses.  In  the  next  section,  we  closely  examine  the 
computational  complexity  involved  in  a multiple  disorders 
problem. 

Multiple  Disorders  Problems 

Most  of  the  previous  work  considered  single  disorder 
problems  by  assuming  that  only  one  disorder  can  be  present  at 
a time,  i.e.,  by  assuming  all  the  possible  disorders  are 
mutually  exclusive.  In  real-world  problems,  however,  multiple 
simultaneous  disorders  can  be  present,  either  because  several 
independent  disorders  are  present,  or  because  one  disorder  may 
induce  or  complicate  another. 

The  multiple  disorders  problem  is  often  referred  to  as  a 
multimembership  classification  problem  for  diagnosis  where 
manifestations  may  be  due  to  several  disorders. 
Multimembership  classification  problems  have  been  recognized 
to  be  very  difficult  to  solve  (Ben-Bassat  [1980],  Peng  and 
Reggia  [1987a]).  Because  the  set  of  2 1 D I diagnosis  hypotheses 
is  very  large  in  many  real-world  problems,  where  D represents 
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the  set  of  all  possible  disorders,  most  existing  diagnostic 
problem-solving  methods  solve  such  multimembership  problems  in 
the  following  way.  First,  since  it  is  computationally 
intractable  to  apply  Bayes'  theorem  directly  to  2 lD I mutually 
exclusive  hypotheses,  Bayes'  theorem  is  employed,  under 
certain  assumptions,  to  calculate  the  probabilities  that  the 
given  case  belongs  to  each  individual  disorder  class,  i.e.  to 
calculate  P ( d i | QM ) for  each  disorder  d;  where  QM  denotes  the 
observed  manifestations.  Then,  based  on  these  possibilities, 
some  nonprobabilistic  means  (often  ad  hoc)  may  be  used  to 
determine,  as  a whole,  to  which  set  of  disorder  classes  the 
given  case  most  likely  belongs,  i.e.,  to  construct  the  most 
likely  hypothesis  configurations.  These  methods  are  not 
generally  satisfactory  because  1)  under  the  given 
manifestations  a case  belonging  to  one  disorder  class  is  not 
independent  of  it  belonging  to  another  disorder  class,  and 
therefore  the  final  solution  may  be  misled  by  the 
probabilities  P(df  |Qm)  for  individual  disorder  classes;  2) 
assumptions  made  in  these  methods  (e.g.,  symptoms  are 
independent  or  conditionally  independent  under  any  d;)  are 
unreasonable  in  general. 

Figure  2.2  shows  all  possible  sets  of  2 lD  I diagnosis 
hypotheses  where  {a,b,c,d}  is  a set  of  possible  hypotheses  for 
disorder  candidates  and  |D|  = |{a,b,c,d)|  =4.  In  fact,  if  no 
manifestation  may  occur  without  some  disorders,  the  number  of 
possible  disorder  candidates  is  15,  i.e.,  24  - 1,  in  this 


26 


{a,b,c,d} 


Figure  2.2  Possible  Hypotheses  Space  for  |D|  =4 
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example.  As  we  can  see  in  this  example,  the  computational 
complexity  involved  in  a multiple  disorders  problem  is 
exponentially  increased  as  the  number  of  disorders  increases. 
Thus,  to  effectively  solve  general  diagnostic  problems,  many 
efforts  have  been  made  to  seek  a systematic  means  to  reduce 
computational  complexity  through  shrinking  the  hypotheses 
space  being  considered.  As  discussed  before,  the  numeric 
method  such  as  Bayesian  theorem  is  not  directly  applicable  to 
general  diagnostic  problems  including  a multiple  disorder 
problem.  Thus,  various  methods  have  been  attempted  to 
effectively  integrate  numeric  measures  of  uncertainty  and 
symbolic  structural  knowledge  for  more  general  diagnostic 
problem  solving.  In  the  next  section,  following  the  discussion 
of  the  integrated  model,  previous  efforts  for  coupling  the 
numeric  reasoning  with  the  symbolic  reasoning  will  be 
discussed. 


2.4  Integrated  Model  for  Diagnostic  Reasoning: 

Both  Symbolic  and  Numeric 

Roughly  speaking,  there  are  two  different  kinds  of 
knowledge  utilized  by  most  real-world  diagnostic  expert 
systems;  symbolic  knowledge  and  numeric  knowledge  (Kitzmiller 
and  Kowalik  [1986],  Peng  and  Reggia  [1987a]).  Symbolic 
knowledge,  usually  based  on  cognitive  and  qualitative  models, 
specifies  which  entities  are  associated  by  what  kind  of 
association.  Among  all  kinds  of  associations,  cause-effect 
relations  are  the  most  important  in  diagnostic  problem 
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solving.  Numeric  knowledge,  on  the  other  hand,  reflects  the 
uncertainties  involved  in  such  real-world  symbolic 
associations.  Numeric  knowledge  typically  specifies  the 
strengths  of  association  and  prior  probabilities  of  individual 
entities . 

Traditionally,  reasoning  based  on  numeric  knowledge  has 
provided  the  user  a great  deal  of  accuracy,  but  none  of  these 
insights.  In  order  to  solve  many  problems  in  complex  systems, 
both  insight  (from  symbolic,  structural  knowledge)  and 
precision  (from  numeric,  quantitative  knowledge)  are  often 
required.  In  many  instance,  insight  into  problem  solving  must 
be  gained  in  order  to  obtain  a solution  in  a best  way. 

It  has  also  been  pointed  by  many  experts  that  even  if  we 
have  a complete,  well-defined  numerical  model  of  a target 
domain,  this  by  itself  may  be  insufficient  for  many  tasks, 
since  symbolic,  qualitative  reasoning  is  required  to  interpret 
the  numeric  values  of  the  various  problem  variables.  Thus  the 
solution  of  complex  systems  often  compels  us  to  switch  back 
and  forth  between  numeric  and  symbolic  reasoning. 

Szolovit  and  Pauker  explain  why  such  reasoning  is 
necessary  in  the  complex  diagnostic  problem,  for  instance,  in 
medicine : 

Why  are  categorical  decisions  not  sufficient  for  all  of 
medicine  ? Because  the  world  is  too  complex  ! Although 
many  decisions  may  be  made  straightforwardly,  many  others 
are  too  difficult  to  be  prescribed  in  any  simple  manner. 
While  many  factors  may  enter  into  a decision,  when  those 
factors  may  themselves  be  uncertain,  when  there  is  a 
significant  cost  associated  with  gathering  information 
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that  may  not  actually  be  required  for  the  decision,  then 
the  rigidity  of  the  flow  chart  makes  it  an  inappropriate 
decision-making  instrument  (Szolovit  and  Pauker  [1978], 

pp.  118)  . 

2.4.1  Symbolic  Model  for  Diagnostic  Reasoning 

Diagnostic  problems  have  often  been  cast  into  a pattern 
recognition  or  statistical  decision  theory  framework.  Many 
well-known  methods  such  as  those  based  on  Bayes'  theorem  have 
been  used  (Weiss,  Kulikowski,  Amarel,  and  Safir  [1978],  Tou 
[1982],  Patrick  and  Fattu  [1986]).  The  difficulties  with 
directly  applying  these  methods,  such  as  scarcity  of 
statistics,  the  use  of  invalid  approximations,  and  the 
combinatorial  explosion  for  multiple  disorders,  are  also 
sufficiently  persistent  that  alternative  approaches  have  been 
sought. 

In  the  past  ten  years,  there  has  been  increased  interest 
in  the  application  of  knowledge-based  symbolic  processing 
models  to  diagnostic  decision-making.  This  symbolic  model 
attempts  to  capture  decision  making  rules  explicitly,  while 
statistical  methods  may  extract  them  implicitly  from 
accumulated  sample  experience.  The  symbolic  approaches  intend 
to  overcome  some  of  the  limitations  of  purely  statistical 
methods  by  developing  a more  structured  representation  of  the 
diagnostic  problems.  This  structured  knowledge,  usually  in 
symbolic  form,  specifies  which  entities  are  associated  by  what 
kind  of  associations.  Among  all  kinds  of  associations,  cause- 
effect  relations  are  the  most  important  in  the  diagnostic 
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problem  solving,  which  can  be  formulated  by  the  well-known 
causal  model. 

Causal  model  for  diagnosis.  In  diagnostic  problem 
solving,  the  causal  model  is  the  important  knowledge  which 
describes  the  causal  relationships  between  the  possible 
disorders  (causes,  or  disease)  and  possible  symptoms  (defects, 
or  manifestations) . Two  of  the  distinguishing  characteristics 
of  the  causal  model  are  (Hudlicka  [1988]): 

(1)  That  it  encodes  the  structure  and  behavior  of  the  system 
it  represents,  making  explicit  the  causal  relationship 
among  the  system  components  and  behavioral  states,  and 

(2)  That  it  contains  only  domain  knowledge.  No  control 
knowledge  to  implement  a particular  type  of  reasoning  is 
included  in  the  model. 

Many  representational  formalisms  are  used  for  causal  models. 
These  include  predicate  calculus  (Genesereth  [1985]), 
constraint  networks  (Davis  [1985]),  qualitative  equations 
(Forbus  [1985]),  and  causal  association  networks  (Weiss, 
Kulikowski,  Amarel,  and  Safir  [1978]) . Among  those  schemes,  we 
choose  the  causal  association  network  as  the  knowledge 
representation  scheme  for  the  diagnostic  reasoning  because  of 
its  capabilities  of  effectively  representing  multiple  disorder 
problems  and  of  easily  updating  the  causal  relations. 

2*4.2 Numeric  Model  for  Diagnostic  Reasoning 

The  drawback  of  symbolic  reasoning  is  that  it  often 
generates  ambiguous  results.  In  fact,  the  numeric  reasoning 
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approach  inherently  provides  more  accurate  tools  for  the 
diagnostic  reasoning.  A number  of  formal  schemes  for  the 
weighting  of  evidence,  in  a numeric  form,  have  been  developed 
and  we  shall  concentrate  on  one  of  them,  the  probabilistic,  to 
contrast  it  with  the  above  discussed  symbolic  reasoning.  We 
believe  that,  with  appropriate  limitation,  probabilistic 
reasoning  can  be  an  appropriate  component  of  computerized 
fault  diagnosis  and  supervisory  control  in  a distributed 
process  environment. 

The  failure  of  the  pure  probabilistic  reasoning  lies  in 
its  voracious  demand  for  data  (Szolovits  and  Pauker  [1978]). 
Because  of  the  distortions  that  the  pure  probabilistic  scheme 
imposes  on  the  problem  and  because  of  the  enormous  data 
requirements  it  implies,  it  tends  to  be  used  successfully  only 
in  small,  well-constrained  problem  domains. 

2.4.3  Coupling  Symbolic  and  Numeric  Reasoning 

Separately,  neither  the  symbolic  nor  the  numeric  approach 
can  successfully  address  all  problems  in  complex  domains. 
Complex  problems  such  as  the  fault  diagnosis  in  a distributed 
process  environment  cannot  effectively  be  solved  by  purely 
symbolic  or  numeric  reasoning  techniques  because:  (1)  purely 
symbolic  reasoning  may  hypothesize  a set  of  disorder 
candidates  based  on  its  structural  knowledge,  but  cannot 
precisely  rank  or  differentiate  a set  of  disorder  candidates 
due  to  its  lack  of  accurate  tools,  and  (2)  purely  numeric 
reasoning  may  provide  an  accurate  means  to  solving  uncertainty 
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problems,  but  may  require  too  much  computation  complexity 
which  leads  to  unreasonable  assumptions.  In  such  instances  a 
mix  of  numeric  and  symbolic  techniques  is  needed  to 
effectively  obtain  a solution.  Therefore,  comprehensive 
reasoning  systems  of  diagnosis  can  be  achieved  by  integrating 
two  reasoning  approaches  into  a single  framework  for  more 
general,  complex  problem  solving.  In  fact,  the  integrated 
model  can  provide  more  systematic  means  to  reduce  the 
computational  complexity  by  effectively  limiting  the  number  of 
hypotheses  considered.  The  symbolic  knowledge  can  eliminate 
some  difficulties  in  directly  applying  the  numeric  measure  to 
all  the  possible  hypotheses. 

2.4.4  Previous  Work  for  Integrating  Numeric  and  Symbolic 
Reasoning 

It  has  long  been  recognized  in  the  field  of  machine 
intelligence  that  there  is  a strong  need  for  effectively 
integrating  numeric  measures  of  uncertainty  and  symbolic 
structural  knowledge  in  expert  systems  (Peng  and  Reggia 
[1987],  Szolovits  and  Pauker  [1978],  Tou  [1982]).  Especially, 
a diagnostic  expert  system  needs  this  integration  for  its 
reasoning  scheme  to  effectively  reduce  the  computational 
complexity.  To  accomplish  this  goal,  a number  of  methods  have 
been  attempted.  Examples  include  the  use  of  "certainty 
factors"  with  production  rules  as  originated  in  MYCIN  (Davis, 
Buchanan,  and  Shortliffe  [1977]),  various  applications  of 
fuzzy  set  theory  (Zimmermann,  Zadeh,  and  Gaines  [1984]),  the 
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Dempster-Shafer  theory  for  ranking  competing  hypotheses 
(Gordon  and  Shortliffe  [1985]),  the  model  for  combining 
Bayesian  decision  theory  with  rule-based  deduction  in 
PROSPECTOR  (Duda,  Gaschnig,  and  Hart  [1979]),  and  the 
probabilistic  causal  model  by  integrating  symbolic  causal 
inference  with  numeric  probabilistic  inference  (Peng  and 
Reggia  [1987a],  [1987b]).  The  pattern-directed  approach 
employed  in  MEDIKS,  APRIKS , and  AVIS  (see  Section  2.5)  is  also 
one  of  these  efforts  by  combining  statistical  pattern 
recognition  techniques  with  its  content-associative  knowledge 
hierarchy.  None  of  these,  or  most  other  previous  work,  is 
fully  adequate  for  use  in  general  diagnostic  problem  solving, 
particularly  in  situations  where  the  underlying  knowledge  is 
represented  primarily  as  causal  associations  rather  than 
production  rules,  where  multiple  disorders  can  be  present 
simultaneously,  and  where  the  assumption  for  the  hypotheses 
independence  (i.e.  disorders  independence  and/or 
manifestations  independence)  cannot  be  justified. 

Peng  and  Reggia' s work  (Peng  and  Reggia  [1987a],  [1987b]) 
has  been  considered  one  of  the  most  general  and  comprehensive 
approaches  to  integrating  probabilistic  inference  with  a 
symbolic  model  of  diagnostic  reasoning  based  on  causal 
associations.  This  approach  provides  a formal  probabilistic 
causal  model  that  integrates  Bayesian  classification  with  a 
domain— independent  artificial  intelligence  model  of  diagnostic 
problem  solving,  so  called  "parsimonious  covering  theory".  To 
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solve  very  complex  real-world  problems,  however,  the 
generality  and  robustness  of  this  integrated  model  should  be 
expanded  by  considering  several  critical  issues  in  diagnostic 
problem  solving,  such  as  hierarchical  knowledge 
representation,  disorder  dependence,  cardinality  constraint, 
and  categorical  causal  reasoning. 

Tou's  PADIKS  approach  is  powerful  in  solving  diagnostic 
problems  with  taxonomic  properties,  which  significantly  reduce 
the  computational  complexity  by  considering  just  a small 
fraction  of  subcategory  at  a time,  mainly  due  to  its 
hierarchical  knowledge  representation.  However,  this  approach 
is  inefficient  to  handle  multiple  simultaneous  disorders  and 
assumes  the  manifestation  independence  so  that  the  final 
solution  may  be  misled  by  the  wrong  posterior  probabilities 
for  individual  disorders  in  some  application  domains. 

By  closely  examining  the  above  two  respectable 
approaches,  we  have  found  what  improvements  and  extensions  are 
needed  for  more  general  diagnostic  problem  solving.  Our  main 
concern  in  this  comparative  work  is  to  show  how  they  reduce 
the  burden  of  the  computational  complexity  in  a general 
diagnostic  problem  (or  how  they  limit  the  number  of  hypotheses 
considered) . 

Peng  and  Reggia's  work.  Combined  with  the  set  covering 
theory,  the  causal  knowledge  is  utilized  to  hypothesize  a 
small  number  of  the  disorder  candidates  which  explain  all  the 
observed  manifestations.  However,  this  work  does  not  have  the 
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efficient  representation  scheme  of  causal  knowledge  such  as 
the  hierarchical  knowledge  representation.  Also,  the  causal 
model  using  set  covering  theory  may  generate  a unreasonably 
large  number  of  disorder  candidates  and  may  hypothesize  sets 
of  non-plausible  multiple  simultaneous  disorders  due  to  its 
relying  on  the  binary  causal  relations. 

PADIKS  approach.  Combining  with  its  knowledge  sketch- 
detail  scheme,  the  PADIKS  approach  makes  use  of  its  content- 
associative  hierarchy  for  a causal  knowledge.  This 
representation  scheme  allows  the  system  to  focus  on  a small 
fraction  of  the  causal  knowledge  at  a time.  However,  this 
approach  cannot  hypothesize  more  than  one  disorder  in  each 
category  because  it  assumes  all  the  disorders  in  a certain 
category  are  mutually  exclusive.  To  solve  a multiple  disorders 
problem,  this  mutual  exclusiveness  assumption  should  be 
eliminated  so  that  the  PADIKS  approach  is  applicable  to  more 
general  diagnostic  problem  solving.  Since  the  PADIKS  approach 
has  successfully  been  applied  to  several  application  domains 
and  the  new  diagnostic  mechanism  proposed  in  this 
dissertations  aims  to  improve  the  PADIKS ' s diagnostic 
capability,  we  elaborately  discuss  the  concept  of  the  PADIKS 
approach  in  the  next  section. 

2.5  Pattern-Directed  Approach 

The  major  approaches  to  the  design  of  knowledge-based 
expert  systems  are:  (1)  the  rule-driven  approach,  or  (2)  the 
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pattern-directed  approach  (Tou  [1985a],  [1985b]).  The  rule- 
driven  approach  makes  use  of  a collection  of  "IF-THEN"  rules. 
A typical  rule-driven  expert  system  is  the  highly  publicized 
MYCIN  system  (Davis,  Buchanan,  and  Shortliffe  [1977]).  Unlike 
the  conventional  approaches,  the  pattern-directed  approach  is 
based  upon  the  construction  of  pyramid-like  networks  of  know- 
hows which  we  refer  to  as  a knowledge  hierarchy. 

2.5.1  Pattern-Directed  Knowledge-Based  System  fPADIKS) 

A knowledge-based  expert  system  may  be  designed  on  the 
basis  of  the  principle  of  knowledge-based  pattern  recognition. 
Such  design  is  referred  to  as  a pattern-directed  approach  to 
expert  systems.  Traditionally,  pattern  recognition  studies  are 
concerned  with  relatively  small  numbers  of  pattern  classes 
that  are  characterized  by  a small  set  of  features.  In  recent 
years,  attention  has  been  turned  to  a second  type  of  pattern 
recognition  problem  which  deals  with  very  large  numbers  of 
pattern  classes  characterized  by  extremely  large  sets  of 
attributes  (Tou  [1982],  [1985a]).  Approaches  to  solve  this 
type  of  problem  makes  use  of  a knowledge  base  and  are  referred 
to  as  knowledge-based  pattern  recognition,  which  has  a salient 
feature  of  involving  very  large  numbers  of  classes  and 
attributes.  Examples  include  diagnostic  problems,  design 
problems,  troubleshooting  problems,  image  understanding 
problems,  and  robot  vision  problems. 
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2.5.2  Knowledge  Representation 

Knowledge  is  information  about  the  world  which  allows  an 
expert  to  make  decisions.  This  knowledge  needs  to  be 
represented  and  employed  in  a form  that  can  be  used  for 
reasoning.  The  process  of  representing  this  knowledge  formally 
is  called  knowledge  representation.  Just  as  data  structures 
are  used  to  store  and  deal  with  data,  knowledge  structures  are 
used  to  store  knowledge  and  reason  with  it.  The  type  of 
knowledge  representation  that  is  appropriate  in  a given  domain 
depends  on  what  sort  of  knowledge  is  being  represented  and  how 
it  is  to  be  applied.  Many  representational  formalisms  have 
been  proposed,  such  as  semantic  networks,  production  rules, 
frames,  conceptual  dependency  structures,  and  scripts 
(Lingarkar,  Liu,  Elbestawi,  and  Sinha  [1990]). 

An  important  feature  of  the  knowledge  base  in  the  PADIKS 
approach  is  the  semantically-organized  category  file  which  is 
built  upon  a content  associative  hierarchy  tree  structure.  In 
the  associative  hierarchy,  each  node  represents  an  entity 
associated  with  a set  of  attributes  and  values  which  form  the 
information  patterns  describing  the  knowledge  needed  for 
diagnostic  reasoning  (Tou  and  Depree  [1978],  Tou  [1985a]). 

The  structured  knowledge  base  is  composed  of  a knowledge 
sketch  and  knowledge  details.  The  knowledge  sketch  represents 
the  outline  relational  structure  and  the  problem-oriented 
hierarchies  of  the  stored  domain  knowledge.  The  most  general 
concepts  are  placed  at  the  top  of  the  tree  and  the  most 
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specific  items  at  the  bottom  of  the  hierarchy.  Knowledge 
details,  on  the  other  hand,  contain  various  data  files,  which 
mainly  consist  of  the  leaf  nodes  in  the  associative  tree. 
Entity-relation  (ER)  structure 

In  the  PADIKS  approach,  the  domain  knowledge  is 
represented  in  terms  of  three  types  of  primitives;  (1) 
entities,  (2)  attributes  and  (3)  relations.  Entities  are  types 
of  information  which  are  tied  together  by  a set  of  relations 
to  form  a knowledge  sketch.  Attributes  are  characteristics  of 
other  knowledge  entities.  Attributes,  when  associated  with  a 
particular  entity  occurrence,  may  be  valued  or  weighted.  The 
characteristic  attributes  for  each  entity  occurrence  may  be 
viewed  as  a pattern  vector,  which  may  be  used  as  a basis  for 
decision-making  and  matching.  The  interrelationship  between 
entities  are  called  relations.  The  set  of  relations  virtually 
determines  the  overall  structure  of  a knowledge  domain.  Entity 
primitives  are  abstracted  knowledge  with  the  associated 
realistic  sense  being  a function  of  the  domain  properties  as 
well  as  problem-solving  strategies.  Entities  can  be  decomposed 
into  components  which  are  more  elementary  in  the  sense  that 
each  of  them  has  a more  homogeneous  conceptual  meaning.  The 
decomposition  of  entities,  in  turn,  defines  the  third  type  of 
primitives,  relations,  denoting  the  epistemic  structure  of  the 
knowledge  domain. 

For  instance,  in  an  Earth  science  data  and  information 
system  we  may  have  relations  such  as; 
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OBSERVATION  (observation_number , instrument_name,  object, 
version,  region,  observing_period,  date,  time, 

• • • • ) 

INSTRUMENT  ( instrument_name , measurement,  resolution, 
period,  region,  calibration  information,  ....) 

Here,  entities  are  OBSERVATION  and  INSTRUMENT;  attributes  are 
observation_number , version,  and  all  other  terms  in  lower  case 
letters;  and  finally  a relation  is  that  the  entity 
"OBSERVATION"  is  measured  by  entity  "INSTRUMENT",  because 
" instrument_name"  is  a key  attribute  of  the  entity  INSTRUMENT. 

In  order  to  build  a knowledge  base  which  has  entity- 
relation  structure,  first,  each  entity  and  its  attributes  are 
defined  with  a set  of  possible  values  for  each  attribute, 
i . e . , 


Ef  - (An  (Vj^vf^V?,,  . .)  , Ai2(V]2,V?2,V?2,  . . ) Ain(Vjn,v2n 


where  E;  denotes  the  ith  entity  in  a given  domain;  A;j 
the  jth  attributes  of  the  entity  E( ; and  V*.  is 
possible  value  of  the  attribute  Af j . 


' Mn ' * * ) 


denotes 
the  kth 


Example  2 . 1 

Let  us  consider  a simple  example  for  an  entity-relation 
structure  in  an  Earth  science  data  and  information  system. 


Entity  (Ef ) : OBSERVATION 

Attribute  {A^}  : An:  observation_number 

Aj2:  instrument_name 
Aj3:  object 
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Aj4:  version 

Aj5:  region 

Aj6:  observing  period 

Aj7:  date 

Aig:  time 

Aj9:  spectral  resolution 


We  may  have  a set  of  examples  for  attribute  values. 


Value  {V^.} 


Vn:  soil  moisture 

Vj2:  microwave  radiometer 

Vj3:  earth 

Vj4:  the  14th  version 
Vj5:  Gainesville,  Florida 
Vj6:  2 days 
Vj7:  1990:10:10 
Vjg:  21:23:34 
Vj9:  20  cm  ± 1 cm 


Entitv-cateqorv-relation  (ECR)  structure 

The  ECR  structure  is  the  extended  version  of  EAR 
structure  for  knowledge  base  representation,  which  introduces 
the  category  concept.  The  concept  of  category  provides  more 
hierarchical  knowledge  representation  by  considering  the 
categorical,  hierarchical  relationships  between  a set  of 
entities  in  a given  application  domain.  In  our  application 
domain  of  a PCB  manufacturing  operation,  we  represent  various 
types  of  defects  with  the  classif icatory  property  by  using  the 
entity-category-relation  structure  as  shown  in  Table  2.1, 
where  categories  are  subsets  of  entities.  In  Table  2.1,  the 
hierarchical  relationships  between  entities  can  easily  be 
identified  by  examining  the  category  code  given  to  each 
entity.  We  transform  this  table  form  of  the  category  structure 
into  the  graphic  representation  as  shown  in  Figure  2.3  which 
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efficiently  illustrates  the  categorical  classification  of  the 
possible  manifestations  in  a PCB  manufacturing  operation. 

2.5.3  Diagnostic  Model  in  PADIKS 

In  the  PADIKS  approach  to  diagnostic  problem  solving, 
knowledge  is  represented  by  an  associative  tree  structure  that 
is  designed  to  facilitate  knowledge  retrieval  (Tou  [1978], 
Chang  and  Tou  [1984]).  Two  types  of  elements  are  used  for 
general  diagnostic  problems:  disorder  and  manifestation. 
Disorders  are  organized  to  form  a hierarchical  category 
structure,  starting  from  the  top  level  with  most  general 
disorders  to  the  bottom  level  with  the  most  specific 
disorders.  For  example,  in  a PCB  manufacturing  operation,  the 
most  general  disorders,  such  solder  defect,  installation 
defect,  coating  defect,  etc,  are  at  the  top  level.  On  the  other 
hand,  the  most  specific  disorder,  such  as  excess  solder, 
pinholes,  solder  bridge,  missing  solder,  etc,  are  at  the 
lowest  level. 

Corresponding  to  each  disorder,  there  is  a set  of 
manifestations  that  are  used  to  characterize  the  disorder. 
Each  manifestation  has  a weighting  factor  W1  to  indicate  its 
occurrence  frequency  when  a given  disorder  is  present.  Another 
weighting  factor  W2,  indicating  the  occurrence  frequency  of  a 
disorder  when  a given  manifestation  is  observed,  can  be  also 
recorded.  These  data  provide  the  relationships  between 
disorders  and  manifestations.  Since  disorders  are  organized  in 
a category  tree  structure  in  PADIKS,  the  manifestations  of  a 
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Table  2.1  Part  of  the  Category  File  for  PCB  Defects 


Category  Code 

Disorder  Name 

1 

1.1 
1.1.1 
1. 1.1. 1 
1. 1. 1.2 
1. 1.1.3 

Test  data 
Component  test 
Installation 
Wrong  location 
Missing  component 
Wrong  polarity 

1.1.2 
1.1. 2.1 
1.1. 2. 2 

1. 1. 2 . 3 

1.1.3 

1. 1. 3 . 1 

1.1. 3. 2 

1.1. 3. 3 
• 

Wrong  component 

Type 

Value 

Material 

Damaged 

Chipped  component 
Cracked  component 
Broken  component 

• 

1.2 
1.2.1 
1.2. 1.1 
1.2. 1.2 

1.2. 1.3 

1.2. 1.4 

1.2. 1.5 

1.2. 1.6 
1.2. 1.7 

Solder  test 
Solder 

Excess  solder 
Pinholes 
Solder  bridge 
Missing  solder 
Fractured  solder  joint 
Insufficient  solder 
Solder  bubbles 

1.3 

1.3.1 

1. 3 . 1. 1 

1. 3 . 1. 2 
• 

In-circuit  test 
Electric  defect 
Resistance 
Capacitance 

• 

1.3.2 

1. 3.2.1 

1.3. 2. 2 

1.3. 2. 3 

1.3. 2. 4 

1.3. 2. 5 

Tolerance 

Shorted 

Open 

Intermittent 

Voltage 

Frequency 
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1 


1. 1.2.1  1. 1.2.2  1. 1.2.3  1.2.1. 1 1.2.1. 2 1.2.1 .3  1.3.2.1  1.3.2.2 


Figure  2.3  Graphic  Representation  of  the  Category  File 
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nonleaf  node  disorder  can  be  obtained  by  logically  ORing  all 
the  manifestations  of  its  subnode  disorders.  And  their 
weighting  factors  W^s  are  calculated  by  algebraically  summing 
the  weighting  factors  W, ' s of  all  the  corresponding 

manifestations  of  the  subnode  disorders,  if  all  the  primary 
weighting  factors  W, 1 s of  the  manifestations  for  every  subnode 
disorder  have  been  normalized  to  one.  The  weighting  factors 
W2's  can  be  obtained  by  Bayes  rule. 

Let  D and  M be  the  set  of  disorders  and  manifestations 
under  consideration,  i.e., 

D = {d.) , i = 1,  2,  . . . , 1 

and, 

M = { ) , j = 1 , 2 , . . . , k 

where  1 and  k are  the  number  of  possible  disorders  and 
manifestation,  respectively.  Corresponding  to  disorder  d. , 
there  is  a set  of  manifestations  Mf  c M,  called  a pattern 
vector.  Since  the  disorders  form  a hierarchical  tree,  for 
disorder  df  which  is  not  in  the  lowest  level,  there  is  a set 
of  subnodes  Dj  = { ci . j } c D.  Each  subnode  disorder  is 
characterized  by  a set  of  manifestations,  i.e.,  Mfj.  is  a 
pattern  vector  corresponding  to  the  subnode  disorder  d(j  of  the 
parent  disorder  df . 

The  priori  probability  p(d.)  is  defined  as  the 
probability  of  the  appearance  of  disorder  df  in  the  population 
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under  consideration.  The  conditional  probability  P(mj  | df ) is 
defined  as  the  conditional  probability  of  the  manifestation 
m. , if  the  disorder  df  is  present.  P(di ) 's  and  P(m.  | d; ) 's  can  be 
obtained  from  history  data  and  record  in  quality  control  and 
expert 1 s knowledge . 

Let  Q = {q,}  c M be  a query  vector  which  is  a set  of 
observed  manifestations.  The  diagnostician  always  tries  to 
find  disorders  that  can  explain  the  observed  manifestation  in 
the  best  way.  In  general,  the  diagnostic  process  can  be 
carried  out  by  comparing  the  similarities  of  all  disorders. 
The  disorder  with  greater  similarity  is  more  likely  to  be  the 
one  causing  similar  responses.  In  practical  cases,  the 
observed  manifestations  may  be  caused  by  a single  disorder  or 
by  several  disorders  that  may  be  distributed  in  different 
disorder  categories.  Since  disorders  are  organized  to  form  a 
hierarchical  tree  category  structure  in  the  PADIKS  approach, 
the  final  disorder  causing  the  observed  manifestations  can  be 
obtained  by  traversing  all  the  possible  category  subtrees.  So 
the  diagnostic  process  can  be  simplified  to  an  iterative 
process  of  selecting  all  the  possible  disorders  under  a 
category  subtree  in  the  PADIKS  approach. 

For  any  given  category  disorder  node,  d{ , the  top  N 
possible  disorders  can  be  determined  by  calculating  the 
posterior  probabilities,  i.e.,  similarities,  of  all  the 
subnode  disorders,  Df  = {dfj. },  and  then  classifying  these 
subnode  disorders  to  find  the  most  plausible  (or  probable) 
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disorder  class.  The  posterior  probability  equation  used  in  the 
PADIKS  approach  is 


P(dj|Q) 


P (Q I d,- ) P(d,) 

P(Q) 


I 

dijcdi 


P (Q I dfj)  P(dr>) 
P(Q) 


(2.1) 


Equation  (2.1)  assumes  that  all  the  manifestations  are  chosen 
such  that  they  are  mutually  exclusive  events  in  the 
manifestation  space  M,  that  is, 

P(Q)  = Y.  p (Q I dk)  P(dk)  (2‘2) 

k=1 


and 


P(Q|dfi)  = 


n 


mte  qD 


M, 


p (mi  | d,- j) 


(2.3) 


ij 


From  (2.1) , the  relative  magnitudes  of  posterior  probabilities 
for  existence  of  disorders  can  be  obtained  by  comparing  their 
importance . 

The  classification  techniques,  such  as  the  K-means 
algorithm,  are  used  for  the  disorder  clustering.  After  all  the 
hypothesized  disorders,  which  are  the  selected  possible  leaf 
node  disorders,  have  been  found,  a comparison  is  conducted 
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between  the  observed  manifestations  and  those  in  the  final 
hypothesized  disorder. 

We  have  discussed  the  knowledge  representation  scheme  and 
the  inference  scheme  in  the  PADIKS  approach  which  was  designed 
at  the  Center  Information  Research.  Based  on  this  discussion, 
we  point  out  some  problems  existing  in  the  current  PADIKS 
approach  in  the  following  section  that  provide  the  room  for 
our  improvement. 

2.5.4  Some  Problems  in  Current  PADIKS  Approach 

Combined  with  its  content-associative  knowledge 
hierarchy,  the  PADIKS  approach  is  powerful  in  dealing  with 
classification  problems  such  as  diagnostic  problems  which  are 
concerned  with  the  assignment  of  the  observed  features 
(manifestations)  to  a certain  class  (or  disorder) . However, 
the  PADIKS  approach  employed  in  the  previous  systems  (such  as 
MEDIKS,  APRIKS,  and  AVIS)  has  some  difficulties  in  handling 
more  general  diagnostic  problems.  Especially,  when  multiple 
disorders  can  occur  simultaneously,  the  PADIKS  approach  may 
lead  to  wrong  diagnostic  solutions  due  to  its  assumption  of 
the  mutual  exclusiveness  of  disorders.  Even  though  the  PADIKS 
approach  can  focus  on  only  a small  fraction  of  the  initial 
hypotheses  space  because  of  its  highly  efficient  knowledge 
structure,  this  multiple  disorder  problem  is  very  difficult  to 
solve  without  a more  systematic  method  in  reducing  the 
hypotheses  space.  Furthermore,  it  can  hardly  be  expected  in 
general  diagnostic  problems  that  all  the  categories  in  the 
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hierarchical  knowledge  base  have  small  sets  of  subcategories 
where  multiple  disorder  problems  are  relatively  easy  to  solve. 

2.5.5  Improvements  on  the  PADIKS  Approach 

In  this  dissertation,  we  draw  on  the  advantages  of  the 
PADIKS  approach  such  as  its  knowledge  representation  and 
pattern-directed  inference  scheme,  but  modify  this  approach  to 
effectively  handle  the  multiple  disorders  problem  by  removing 
the  constraints  imposed  on  hypotheses  such  as  the  mutually 
exclusive  disorders  assumption  and  the  manifestation 
independence  assumption. 

We  propose  a new  diagnostic  mechanism,  called  "multi- 
level diagnostic  mechanism",  which  consists  of  three 
distinctive  levels  of  the  diagnostic  reasoning  procedures, 
i.e.,  the  model-based  candidate  generator,  the  constraint- 
based  filter,  and  the  pattern-directed  ranker.  First,  in  the 
model-based  candidate  generator,  the  parsimonious  covering 
theory  is  combined  with  the  categorical  knowledge 
representation  scheme.  Thus,  the  categorical  causal  model  is 
designed  to  generate  a set  of  the  disorder  candidates  which 
explain  all  the  observed  manifestations.  In  fact,  this  model 
is  the  combination  of  the  PADIKS  approach  and  Peng  and 
Reggia's  work,  which  can  solve  a multiple  disorders  problem 
while  focusing  on  a small  fraction  of  subcategory  at  a time. 
To  further  limit  the  number  of  disorder  candidates  and  to 
eliminate  non-plausible  disorder  candidates,  three  constraints 
on  the  multiple  simultaneous  disorders  hypothesized  are 
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designed  in  the  constraint-based  filter.  Finally,  the  pattern- 
directed  ranker  perforins  the  differential  diagnosis  on  a small 
subset  of  the  initial  hypotheses  space  so  that  more 
computationally  intensive  approaches  can  be  used  by  removing 
the  unreasonable  assumptions  which  have  been  introduced  to 
simplify  the  diagnostic  computation.  Basically,  this  new 
approach  aims  to  reduce  the  initial  hypotheses  space  into  a 
reasonable  (computationally  tractable)  size  and  then  to  focus 
its  computing  power  on  the  reduced  hypotheses  space  to  rank 
the  hypothesized  disorders.  The  details  of  the  proposed 
diagnostic  mechanism  will  be  discussed  in  Chapter  IV. 


CHAPTER  III 

DATA  PREPROCESSING:  DATA  INTEGRATION 
IN  DISTRIBUTED  PROCESS  ENVIRONMENTS 

3 . 1 Motivation 

As  discussed  in  Section  1.2,  a distributed  process 
environment  usually  consists  of  four  basic  components,  namely, 
the  distributed  process,  the  data  collection  system,  the 
process  control  system,  and  the  operators.  The  data  collection 
system  is  responsible  for  collecting  various  data 
(observation,  measurement,  symptom,  fault,  etc.)  from  the 
distributed  process  and  to  provide  these  data  to  the  process 
control  system  for  diagnostic  problem  solving.  The 
manifestation-based  diagnostic  inference  mechanism  in  the 
process  control  system  assumes  that  all  the  observed 
manifestations  (or  all  the  observations)  are  correctly 
measured  and  collected  when  it  performs  the  diagnostic 
reasoning.  However,  in  real  world  problems,  we  may  have 
erroneous  observations  due  to  various  reasons  such  as 
operators'  mistakes,  inaccurate  measurement  tools,  and 
parameter  deviations  (Tou  and  Chung  [1990]).  Furthermore,  in 
a distributed  process  environment,  these  observations  are 
accumulated  in  a random  fashion  within  the  database  of  the 
process  control  system  via  the  data  collection  system,  which 
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are  very  difficult  to  interpret.  Therefore,  before  performing 
the  diagnostic  reasoning  on  the  collected  observations,  a 
preprocessing  operation  is  required  to  automatically  generate 
the  error-free,  well-structured  observation  file  (or 
manifestation  file  in  a diagnostic  problem) , called  "category 
file". 

We  present  the  design  concept  of  the  data  preprocessing 
system,  called  "data  integration  system",  for  normalization, 
verification,  validation,  association  of  observation  data 
which  are  gathered  at  various  subprocesses  in  a distributed 
process  environment.  The  prime  purpose  for  this  data 
integration  system  is  to  ensure  that  all  the  observations  (in 
fact,  test  data  and  inspection  data  in  our  application  domain) 
applied  to  the  diagnostic  inference  system  are  correct  and 
properly  organized.  Because  the  observed  manifestations  are 
the  most  important  resources  of  our  manifestation-based 
diagnosis  proposed  in  this  dissertation,  erroneous 
observations  may  lead  to  wrong  diagnostic  results.  Thus  the 
data  integration  operation  should  be  designed  not  only  to 
eliminate  the  possibility  of  wrong  diagnosis  due  to  wrong 
manifestations,  but  also  to  facilitate  the  diagnostic 
reasoning  procedures  thanks  to  its  automatic  construction  of 
the  well-defined  data  hierarchy.  During  recent  years,  the 
verification  and  validation  of  observed  data  has  rightfully 
grown  in  importance  especially  for  a distributed  process 
environment  consisting  of  various,  heterogeneous  information 
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sources  which  are  hard  to  control  in  a unified  form.  However, 
there  is  no  systematic  means  to  performing  the  verification, 
validation,  association  of  observed  data  due  to  their  dynamic, 
random  characteristics.  Mostly  these  tasks  have  been  done  by 
manual  operation  which  is  very  time-consuming  and  error-prone. 
Here  we  propose  the  knowledge-based  data  integration  technique 
for  normalizing,  verifying,  validating,  and  associating  the 
data  observed  from  various  subprocesses  by  utilizing  diverse 
domain  knowledge  in  a generalized  scheme.  To  effectively 
explain  the  design  concept  of  the  data  integration  operation, 
the  test  data  collected  from  a PCB  manufacturing  operation  are 
used  for  each  illustration. 

In  a distributed  process  environment,  the  technician  at 
each  subprocess,  often  called  "workcell"  in  a manufacturing 
environment,  enters  the  observed  data  into  a central  database. 
Experience  has  told  us  that  the  database  may  be  erroneous  due 
to  wrong  entry,  wrong  unit,  wrong  format,  wrong  code,  wrong 
scale,  etc..  Thus  the  database  must  be  verified  and  validated 
before  it  becomes  useful.  The  data  in  the  validated  database 
are  then  grouped  and  categorized  for  the  well-defined 
hierarchical  structure.  To  facilitate  the  use  of  the  database, 
the  data  will  be  dynamically  reorganized  to  meet  the  needs  of 
various  users.  In  the  following  section,  we  discuss  the  design 
concept  of  a data  integration  system  in  a distributed  process 
environment,  especially  in  a PCB  manufacturing  operation,  by 
applying  it  to  the  test  data  collected  from  industries.  Thus 
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we  use  the  term  "workcell"  for  the  subprocess  in  a distributed 
process  environment  and  the  term  "test  data"  for  observations 
(or  manifestations) . 

3.2  Design  of  Data  Integration  System 

In  a distributed  process  environment,  technicians  (or 
computerized  subprocesses)  at  various  workcells  enter  test 
data  into  a central  database  via  the  data  collection  system, 
which  we  refer  to  as  the  raw  database.  Our  idea  of  data 
integration  is  to  perform  data  verification  and  data 
validation  of  the  raw  database  after  the  data  normalization 
for  easy  data  manipulation.  The  validated  data  are  then 
automatically  grouped  and  categorized  to  form  a well-defined 
hierarchical  database  for  various  applications  such  as 
diagnostic  reasoning  and  statistical  quality  control.  We  call 
this  operation  data  association.  The  design  concept  of  these 
four  basic  operations  in  data  integration,  shown  in  Figure 
3.1,  is  discussed  in  this  section. 

3.2.1  Data  Normalization 

Prior  to  data  verification,  a normalization  operation  is 
performed  to  unify  the  description  of  each  entity.  The  process 
of  data  normalization  is  the  application  of  a number  of  rules 
to  the  relational  model,  which  is  used  to  describe  all 
entities,  in  order  to  unify  the  attribute  relations  in  the 
entity  representation.  These  rules  prove  to  be  useful 
guidelines  because  the  relations  formed  by  the  normalization 
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raw  database 


normalize  so  that  all 
the  attributes  are  atomic 


identify  erroneous  data  in 
the  raw  database 


correct  or  remove  erroneous 
data  in  the  raw  database 


group  and  categorize  the  data 
into  a well-defined  hierarchical 
structure 


hierarchical  category  files 


Figure  3.1  Design  Concept  for  Data  Integration  System 
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process  will  make  data  easier  to  interpret,  verify  and 
manipulate.  In  the  case  of  printed-circuit  board  (PCB)  test 
data,  we  may  have  the  entity  which  contains  multiple  test 
results  (attributes  "defect_id"  and  "location")  in  a single 
data  entity.  In  this  case,  our  normalization  rule  should 
ensure  that  all  the  attributes  are  atomic  (in  the  smallest 
possible  component)  ; that  is,  there  is  only  one  value  for  each 
domain  and  not  a set  of  values,  as  shown  in  the  following 
example : 

Example  3 . 1 

Before  normalization,  the  entity  representation  contains 
multiple  test  results, 

(pcb_id,  workorder,  op_no,  defect_idi;  location,,  defect_id2, 
location2,  defect_id3,  location3,  date)  . 

After  normalization,  the  entity  is  described  by 


(pcb_id, 

workorder, 

op_no , 

defect_ 

id,, 

id2, 

location, , 
location2, 

date) 

(pcb_id, 

workorder, 

op_no , 

defect_ 

date) 

(pcb_id, 

workorder, 

op_no , 

defect_ 

id3, 

location3 , 

date) . 

3.2.2  Data  Verification 

The  data  verification  stage  is  designed  to  detect  and 
identify  erroneous  data  entities  in  the  raw  database,  and  to 
notify  the  next  stage  of  data  validation  to  correct  these 
entities.  This  stage  consists  of  two  types  of  verification 
operations : 

(1)  attribute-based  verification 

(2)  constraint-based  verification. 
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In  performing  data  verification,  we  utilize  two  types  of 
reference  databases  in  the  knowledge  base;  i.e.  entity/ 
attribute  dictionary  for  attribute-based  verification  and 
value  dictionary  for  constraint-based  verification,  which 
contain  the  knowledge  necessary  for  data  verification. 
Attribute-based  verification  performs  two  levels  of 
verification  for  each  attribute  of  an  entity,  i.e.  the 
syntactic  level  for  data  format,  and  the  semantic  level  for 
data  value,  code,  unit,  and  scale,  based  on  the 
entity/attribute  dictionary  which  describes  the  allowable 
formats,  values,  codes,  and  scales  for  each  attribute  of  an 
entity.  Constraint-based  verification  performs  the  product- 
dependent  dynamic  verification  based  on  the  value  dictionary 
which  describes  the  relationship  between  primary  attributes 
and  constrained  secondary  attribute  values. 

In  performing  the  attribute-based  verification,  we 
utilize  the  entity/attribute  dictionary  database  which 
specifies  allowable  domains  for  each  attribute.  This  allows 
the  information  providers,  such  as  test  operators  or 
computerized  machines,  to  cooperate  with  others  even  though 
they  use  different  formats,  scales,  or  units. 

The  entity/attribute  dictionary  is  defined  as 

Ej  ->  {An(F,C,V,U,S) , 

F — { f ^ , f j , f 3 , • . • } 


. . ,Ajk(F,C,V,U,S)  , . . . , AjN  ( F , C , V,  U , S ) } 
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C — { <-'i  / Cj , C3 , . . • } 

V = {lower  limit,  upper  limit) 

U = {u1  (ranged  ,u2(range2)  ,...  } 

S = {s,  (range,)  ,u2(range2)  , . . . ) 

where  E;  , Ajk  and  N denote  the  entity,  the  kth  attribute  of 
entity  E,- , and  the  number  of  attributes,  respectively;  F,  C, 
V,  U,  and  S denote  sets  of  allowable  formats  ( f n , f 2 , . . ) , codes 
(c,,c2,  . , values  (lower  and  upper  limit),  units  (u1(u2,..)( 

and  scales  (s1#s2,..). 

In  performing  the  semantic  level  of  attribute-based 
verification,  if  the  attribute  has  the  unit  field  (or  scale 
field) , several  allowable  units  (or  scales)  may  be  defined 
within  their  ranges.  For  instance,  an  attribute  may  have 
several  allowable  units,  such  as  "cm",  "m",and  "km".  The 
system  reads  an  attribute  value  from  the  raw  database  and 
determines  its  unit  by  finding  the  range  corresponding  to  that 
value.  The  unit  "cm"  is  the  desired  unit  which  has  the  first 
priority  for  the  range  matching.  If  the  proper  unit  is  found, 
the  procedure  will  be  stopped  with  the  unit  identification 
code  for  the  unit/scale  unification  in  the  data  validation 
stage.  If  not,  this  data  entity  will  be  identified  as 
erroneous . 

Let  us  consider  the  example  of  the  entity  'pcbtest'  for 
typical  PCB  test  data,  as  shown  in  Figure  3.2,  which  will  be 
used  for  every  illustration  in  this  dissertation.  The  first 
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three  attributes  are  entered  by  a bar  code  reader  and  the  last 
three  attributes  are  entered  by  a test  technician.  The  PCB 
inspection  results  are  fed  into  a central  database  via  a 
multiplexer  channel  as  illustrated  Figure  1.3.  The  data  entity 
(053627  4U316  601  171  C13TR14  050589)  in  Figure  3.2  can  be 
interpreted  by  the  system  as  follows: 

The  PCB  (053627  : serial  no.)  for  amplifier  (4U316  : 
product  type)  has  the  defect  (171  : solder  bridge)  between  the 
capacitor  (C13)  and  the  transistor  (TR14) , which  is  detected 
at  the  workcell  (601  : solder  inspection)  on  May  5,  1989. 

Figure  3.3  illustrates  the  structure  of  entity/attribute 
dictionary  for  PCB  test  data  which  allows,  in  this  specific 
example,  different  formats  and  codes.  The  verification 
procedure  is  applied  to  the  attributes  of  each  entity  in  order 
to  identify  erroneous  data  in  terms  of  format  and  code,  by 
performing  the  comparison  operation.  The  underlined  format  has 
the  first  priority  for  verification  (or  comparison)  as  the 
desired  format. 

In  addition  to  the  attribute-based  verification,  a 
product-dependent,  process-dependent  constraint-based 
verification  will  be  performed  using  the  constrained  attribute 
relationship.  This  verification  is  based  on  the  value 
dictionary  describing  the  constraints  between  attributes, 
which  are  dynamically  changeable  upon  the  inquiry  from  the 
manufacturing  department. 

The  value  dictionary  is  defined  as 
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entity  : pcbtest 


1 

pcb_id 

workorder 

op_no 

def ect_id 

location 

date 

053627 

4U316 

601 

171 

C13TR14 

050589 

attribute  (description) 

format 

pcb_id  ( PCB  serial  no.)  16 
workorder  (workorder  no.  for  each  product  type)  A5 
op_no  (ID  of  process  step  or  workcell  no.)  13 
defect_id  (type  of  defect)  A3 
location  (location  of  defect)  A7 
date  (data  collection  date)  16 


An:  alphanumeric  character  (n  : length) 

In:  integer  Fn:  fixed-point  (decimal  number) 


Figure  3.2  An  Example  of  the  Entity  for  PCB  Test  Data 
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entity  : pcbtest 
attribute ( 1) 
attribute (2) 
attribute ( 3 ) 
attribute ( 4 ) 


: pcb_id 

{ format (16 , A6) } 

: workorder 

(format(A5),  code(4U316, 
: op_no 

( format (13 . A3)  , code(020 
: defect_id 

(format (A3),  code (050,.. 


. . .)  } 

, • . ,902)  ) 
, F00 ) } 


Figure  3 . 3 Entity/Attribute  Dictionary  for  PCB  Test  Data 
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( E ; ) { P i / • • • / Pj/  •••/  P^  } 

— > { A1  ( Aj  , Aj  r ...  , A^ ) , . . . f Aj(  A1 , . . . jAj^/Aj^,  . . , A^ ) , . . . , 

Ajj  ( A^ , Aj , , A^ . ^ ) } 

where  PA ( E ? ) denotes  a set  of  primary  attributes  (p1fp2,...) 
for  an  entity  E. ; A;  is  the  primary  attribute  which  constrains 
the  values  and  codes  of  the  secondary  attributes  (A1 , . . , A-. , 
Aj+1 , . . , AN)  . From  this  general  definition,  we  can  easily  find 
that  each  attribute  can  be  both  a primary  attribute  and  a 
secondary  attribute  for  other  attributes. 

For  instance,  a primary  attribute  'workorder'  may  have 
the  allowable  values  of  secondary  attributes  such  as  'pcb_id', 

' op_no ' , and  'date'  in  the  entity  of  test  data  'pcbtest 
(pcb_id,  workorder,  op_no,  defect_id,  location,  date) ' . 
Furthermore,  a primary  attribute  'op_no'  may  have  the 
allowable  values  of  secondary  attribute  'defect_id'  as  shown 
in  Figure  3.4.  In  this  example,  the  data  integration  system 
searches  the  entity  name  ('pcbtest'),  primary  attributes 
( 'workorder ',' op_no ')  , and  their  secondary  attributes  as 
illustrated  in  Figure  3.5.  By  performing  this  constraint-based 
verification,  we  can  examine  and  remove  the  product-dependent 
semantic  level  of  error.  Also  the  value  dictionary  can  be 
updated  and  modified  by  a system  operator  via  the  interactive 
user  interface,  which  should  help  the  consistency  and  accuracy 
for  the  dictionary. 
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entity  : pcbtest 


primary  attribute  : workorder  (4U316  : PCB  for  amplifier) 
secondary  attributes  : 

pcb_id  ( 400000,  401000  : lower  and  upper  limit  for  PCB 
serial  number  for  product  4U316) 
op_no  { 610  : mechanical  test,  601  : solder  joint, 

637  : coating  test  } 

date  { 010588,  030588  : manufacturing  date  ) 


primary  attribute  : op_no  { 610,  601,  . ..,  637  } 

secondary  attribute  : 

defect_id  {(251,  254),  (171  : solder  bridge,  172  : 

missing  solder,  178  : cold  solder),  ..., 

(081,  . . , 084)  ) 


Figure  3 . 4 Example  for  Primary  and  Secondary  Attributes 
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pcbtest ( workorder , op_no ) 


workorder 

4U316 

op_no 

610,601,630,635,595 

590,640,637 

pcb_id 

400000  - 401000 

date 

010588  - 030588 

op_no 

610 

601 

. . . 

637 

defect_id 

251-254 

171-178 

. . . 

081-084 

Figure  3.5  Value  Dictionary  for  Constraint-Based  Verification 
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3.2.3  Data  Validation 

The  data  verification  stage  passes  erroneous  data  with 
the  error  codes  to  the  data  validation  stage.  Data  validation 
accomplishes  the  data  correction  task  in  order  to  generate  the 
error-free  raw  database,  based  on  the  verification  results  in 
the  previous  stage.  In  performing  this  task,  all  attributes 
will  be  transformed  into  the  desired  representation  by 
unifying  allowable  formats,  units,  scales,  in  order  to 
facilitate  the  following  stage  of  data  association  which 
generates  the  hierarchical  data  structure. 

The  data  validation  task  includes  the  following  subtasks: 
Format  modification 

This  subtask  modifies,  if  any,  the  alternative  formats 
into  the  desired  format.  The  system  prompts  the  data  re-entry 
for  the  attribute,  which  has  the  invalid  format,  in  the  user 
interactive  mode. 

Unit/scale  unification 

The  previous  stage  of  data  verification  can  determine  the 
unit  (or  scale)  of  an  attribute  by  the  range  corresponding  its 
value.  The  system  unifies  the  unit/scale  by  multiplying  an 
attribute  value  by  an  appropriate  factor  for  the  desired 
unit/scale. 

Invalid  code  Cor  value)  elimination 

If  invalid  codes  (or  values)  are  detected,  the  system  has 
to  decide  whether  the  corresponding  entity  will  be  eliminated 
or  the  inquiry  for  data  re-entry  will  be  issued.  The  decision 
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is  dependent  on  how  important  that  entity  is.  In  our 
application,  erroneous  test  data  entities  are  kept  from 
further  processing  by  setting  an  error  flag. 

Statistics  for  erroneous  data 

In  addition  to  the  above  generic  functions  of  data 
verification,  the  system  generates  statistical  information  for 
erroneous  data  which  can  be  utilized  to  reduce  errors  by 
informing  the  operators  of  their  most  frequent  mistakes  in 
data  entry. 

3.2.4  Data  Association 

The  error-free  raw  database,  which  has  been  verified  and 
validated  through  the  previous  operations,  is  automatically 
grouped  and  categorized  to  form  a well-structured  hierarchical 
database.  In  the  data  association  stage,  we  represent  the 
database  by  hierarchical  entity-category-relation  model 
(entity-category-attribute-relation  structure)  which  is 
referred  to  as  category  files.  The  raw  database  collected  in 
a random  fashion  is  semantically  categorized  into  an 
associative  tree  by  searching  the  data  entity,  identifying  the 
key  attribute  for  classification,  and  assigning  each  entity  to 
the  proper  node  in  the  associative  tree  structure.  Each  node 
of  the  category  file  hierarchy  represents  an  entity  associated 
with  a set  of  attributes  and  values,  as  illustrated  in  Figure 
3.6. 

The  categorization  procedure  is  defined  as  follows: 
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entity  type 


test  workcell 


defect  domain 


defect  tvaa 


QA 

database 


coating  test 


component  test 


solder  test 


in-circuit  test  — 


mechanical  test 


cleanliness  test 


installation 


— wrong  location 

— missing  component 

— wrong  polarity 
part  not  seated 

— improperly  mounted 

— short  leads 

— long  leads 

— lead  improperly  bent 


wrong  component 


type 

value 

material 


damaged 


solder 


— chipped  component 

— cracked  component 
broken  component 

— measles  component 

— damaged  PCB 

— solder  bridges 

— missing  solder 

— tract,  solder  joint 
excess  solder 

— insufficient  solder 

— pinholes 

— solder  bubbles 

— cold  solder 


tolerance 


shorted 

open 

intermittent 

voltage 

frequency 


electric  defect 


— resistance 

— capacitance 

— voltage  gain 

diode  voltage  level 

— rise/fall  time 

— truth  table 

— leakage 


I — wrong  hardware 

hardware  — missing  hardware 

— damaged  hardware 
' — loose  hardware 


Figure  3 . 6 


Category  File  Structure  for  Various  PCB  Defects 
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FOR  i=l  TO  M ; for  each  entity 

FOR  j=l  TO  L ; for  each  level  of  hierarchy 

E 1 = C - j (Ej , K.j  / Bj j)  ; generate  the  categorized  entity  E * f 
END  j ; 

END  i? 


where  K; 

-> 

{Kj, 

/ K • 2 / • « 

r i 1 / 2 , . . . , M 

Bi 

-> 

r\j 

■ • / BiL } 

Ci 

-> 

(Cji 

' Ci2'  ’ 1 

'•'CjL) 

-> 

<Ci1 

(Ej,Kji 

/ Bii ) / Cf2  (Ej , Kj2,  Bj2)  , . . . / CjL  (Ej , KjL  , BjL)  } 

L : the  number  of  levels  of  the  hierarchy  for  entity  E; 

M : the  number  of  entities  involved  in  the  operation 
K;  : the  set  of  key  attributes  for  E( 

K j j : key  attribute  for  the  jth  level  of  hierarchy 

Bj  : the  set  of  possible  branches  (categories)  for  Es 

Bj  j : the  set  of  possible  branches  for  the  j h level  of 

hierarchy 

Cj  : the  set  of  categorization  operators  for  E. 

Cjj : categorization  operator  for  the  jth  level  of  hierarchy 

In  a PCB  manufacturing  operation,  our  approach  to  test 
data  grouping  is  based  on  the  test  workcell  structure.  The 
categorization  of  various  tests  is  based  on  the  defect  type. 
The  raw  database  is  reorganized  by  the  values  of  key 
attributes  which  are  examined  to  locate  each  entity  in  an 
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associative  tree,  i.e.  workorder,  op_no,  and  defect_id.  For 
the  data  association,  we  use  the  entity-category-relation 
model  (Tou  and  Depree  [1978],  Navathe,  Elmasri,  and  Larson 
[1986])  where  category  is  defined  as  a subset  of  entity.  This 
model  is  known  to  have  a more  tree-like  structure  than  the 
well-known  entity-relationship  model  (Chen  [1976]).  In  Figure 
3.6,  the  node  'test  data',  as  a category  of  the  QA  (Quality 
Assurance)  database,  has  several  attributes  corresponding 
various  test  workcells.  In  turn,  each  node  for  a test  workcell 
may  have  its  subsets  by  categorizing  the  function  of  a test 
workcell  into  several  defect  domains  (or  categories) . Each 
category  for  defect  domain  also  maintains  many  defect  types  as 
its  attributes. 

After  grouping  and  categorizing  data  entities  based  on 
the  identity  (or  similarity)  of  the  key  attributes,  we  can 
associate  further  the  interrelated  entities  in  the  leaf  nodes 
of  the  associative  tree  into  a more  structural  shape.  In 
performing  data  association,  we  expect  to  encounter  different 
entities  in  similar  domains  which  can  be  merged  into  a single 
entity  (or  into  a hierarchical  form  relating  those  entities  by 
the  tree-like  structure) . Three  types  of  similar  domains; 
i.e.,  identical,  enclosed,  and  overlapped,  are  considered  for 
the  data  merging  as  follows; 

Identical  entities 
operation  ; A(Ej)  * A(Ef) 

A ( E j ) is  a set  of  attributes  of  entity  E; . The  merging 
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operation  for  the  identical  entities,  i.e.,  Ef  and  Ej , which 
have  the  same  attributes,  is  to  eliminate  the  duplicated 
declaration  for  entities  in  the  same  domain  by  unifying  the 
entities . 

Enclosed  entities 

operation  : A (E  ' f ) = A(Ef)  - A(Ej) 

The  enclosed  entities,  where  the  entity  E-  in  a larger 
domain  encloses  another  entity  E; , can  be  combined  to  form  a 
hierarchical  representation  by  assigning  the  parent  node  to  E- 
and  assigning  the  son  node  to  the  entity  E^  generated  by  the 
above  operation. 

Overlapped  entities 

operation  : A(Ek)  s a ( E j ) n A(Ej ) 

A ( E 1 1 = A (Ej ) - A(Ek) 

A(E  • j)  = A(Ej)  - A(Ek) 

The  overlapped  entities,  i.e.,  E;  and  E.,  where  these  two 
entities  have  a set  of  common  attributes,  A(Ek)  , can  be  merged 
by  assigning  the  parent  node  to  new  entity,  Ek,  containing  the 
overlapped  (common)  attributes  and  assigning  son  nodes  to  new 
entities,  i.e.,  E ' ■ and  E'^ , containing  the  non-overlapped 
attributes . 

3.3  Feature  Selection  and  Extraction 

In  a distributed  process  environment,  diverse 
observations  (manifestations,  symptoms,  or  measurements)  from 
various  subsystems  (or  subprocesses)  are  collected  in  the 
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process  control  system  for  automation  information  integration. 
In  this  research  work,  our  main  concern  is  to  effectively 
utilize  those  manifestations  (or  observations)  for  finding  the 
problems  (disorders)  currently  existing  in  the  shopfloor, 
i.e.,  manifestation-based  diagnostic  problem  solving.  By 
performing  the  data  integration  techniques  on  the  observed 
manifestations  discussed  in  the  previous  sections,  the  error- 
free  category  files  are  generated.  In  fact,  this  category 
files  are  constructed  on  the  basis  of  the  pre-determined 
classification  scheme  in  the  knowledge  base.  For  instance,  the 
category  file  for  the  PCB  test  data  is  constructed  on  the 
basis  of  the  test  workcell  structure  and  the  defect  domain. 
Thus  the  structure  of  the  category  file  may  not  be  appropriate 
for  the  purpose  of  diagnostic  reasoning. 

The  feature  selection  and  extraction  are  the  data 
conversion  operations  which  convert  the  well-structured 
features  (or  manifestations)  in  the  category  file  into  the 
scheme  adequate  to  our  model-based  causal  reasoning  in  the 
multi-level  diagnostic  mechanism.  These  operations  are 
different  from  the  feature  selection  and  extraction  in  the 
classical  pattern  recognition  problem.  In  fact,  these  involve 
various  data  manipulation  operations  such  as  data  sorting  and 
data  classification.  The  feature  selection  task  is  to  choose 
the  appropriate  manifestations  from  the  category  file  which  is 
generated  by  the  data  association  operation.  This  task 
searches  the  leaf  node  of  the  category  file  to  find  all  the 
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observed  manifestations  which  correspond  to  a certain  disorder 
domain.  The  feature  extraction  is  the  task  to  further  classify 
features  (or  manifestations)  into  subcategories  so  that  each 
subcategory  corresponds  to  the  local  causal  model  in  the 
categorical  causal  model  discussed  in  Chapter  IV.  Combined 
with  the  concept  of  the  categorical  causal  model  which 
consists  of  many  interrelated  local  causal  model,  the  feature 
selection  and  extraction  are  very  important  tasks  to  assign 
all  the  observed  manifestations  into  the  corresponding  causal 
model.  Conseguently , the  categorized  manifestations  contribute 
to  reduce  the  interaction  between  different  local  causal 
models  and  to  accelerate  the  diagnostic  reasoning. 

3.3.1  Feature  Selection 

Feature  selection  is  to  choose  the  set  of  manifestations 
relevant  to  a given  domain.  Since  the  category  file  consists 
of  diverse  measurements  in  a distributed  process  environment, 
the  feature  selection  procedure  is  required  to  select  all  the 
observed  manifestations  which  are  the  relevant  features  of  a 
set  of  disorders  in  certain  problem  domains.  For  instance, 
when  the  diagnostic  system  tries  to  find  the  defect  causes  in 
a PCB  manufacturing  operation,  the  feature  selection  chooses 
all  the  test  data  which  specify  PCB  defects  observed  on  the 
shopfloor . 

The  feature  selection  task  for  ith  category  can  be 


defined  as  follows: 
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FS  ( i ) = { m*  | m*  e M* , m*  e M, } 

where  { > is  a set  of  the  selected  manifestations  for  the  ith 
category  (in  the  categorical  causal  model) . M*  is  all  the 
observed  manifestations  in  the  category  file  and  is  a set 
of  all  the  possible  manifestations  in  the  ith  category.  In 
other  words,  the  feature  selection  is  to  choose  the  set  of 
manifestations,  m£,  relevant  to  the  given  category,  i,  among 
all  the  observed  manifestations,  M*. 

3.3.2  Feature  Extraction 

Feature  extraction  is  the  process  of  further  classifying 
the  selected  features  (or  manifestations)  into  smaller  subsets 
of  manifestations  which  correspond  to  subcategories.  Although 
the  number  of  features  is  very  large  in  most  complex  systems, 
such  as  manufacturing  operation  and  medicine,  the  number  of 
significant  features  for  a particular  class  is  relatively 
small.  For  instance,  Patrick  and  Fattu  states: 

The  number  of  features  for  a total  medical  classification 
system  can  be  quite  large,  perhaps  20,000.  The  number  of 
primitives  used  in  an  Internal  Medicine  system  contains 
over  4000  binary  features.  Yet  a category  such  as 
appendicitis  is  described  by  a relatively  few  number  of 
features,  perhaps  10  to  15  or  even  fewer  (Patrick  and 
Fattu  [1986],  pp.  18). 

In  fact,  we  cannot  expect  that  all  problems  are  clearly 
classified  into  exclusive  and  exhaustive  subcategories. 
However,  in  most  problem  domains  (especially  in  distributed 
process  environments)  , we  are  able  to  select  a set  of 
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subcategories  so  that  each  subcategory  has  no  interaction  (or 
least  interaction)  with  the  others  in  terms  of  the  number  of 
manifestations  overlapped  (shared) . In  the  proposed  system, 
the  criterion  for  performing  the  best  categorization  is  to 
minimize  the  number  of  the  overlapped  manifestations  which  are 
involved  in  multiple  subcategories.  Suppose  we  have  a set  of 
selected  manifestations,  m£  = {m.,,  m2,  m3,  m4,  m5,  m6).  Two  local 
causal  models  in  the  categorical  causal  model  are  involved  in 
the  following  manifestations: 

M1  = { , m4 , m5 , m^ ) 

Mj  - { m2 , m3 , m^ } 

In  this  example,  we  assign  (or  extract)  a set  of 
manifestations  (mlf  m4,  m5,  m6)  to  the  first  causal  model  and 
M2  = {m2,  m3,  nig}  to  the  second  causal  model.  We  have  the 
overlapped  manifestation,  m5,  which  is  redundant  to  this 
categorical  causal  model  approach.  However,  unless  this 
redundance  is  severe,  i.e.,  the  number  of  overlapped 
manifestations  is  very  large,  the  feature  extraction  task  can 
reduce  the  computation  complexity  involved  in  the  diagnostic 
problem  solving  in  coordination  with  the  categorical  causal 
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CHAPTER  IV 

MULTI-LEVEL  DIAGNOSTIC  INFERENCE  MECHANISM 
4 . 1 Introduction 

There  is  a strong  need  for  effectively  reducing  the 
computational  complexity  involved  in  general  diagnostic 
problems  by  limiting  the  number  of  hypotheses  (or  hypothesized 
disorders)  considered.  To  accomplish  this  goal,  we  develop  a 
new  diagnostic  mechanism  to  systematically  shrink  the  initial 
hypotheses  space  into  a small  fraction  of  the  disorder 
candidates  and  to  rank  these  candidates  by  integrating  both 
symbolic  causal  knowledge  and  numeric  measures  of  uncertainty. 
As  discussed  in  Chapter  II,  we  develop  the  theoretic  framework 
for  general  diagnostic  problem  solving  by  drawing  on  the 
experience  from  the  PADIKS  approach  for  generating  a content- 
associative  knowledge  base,  by  utilizing  the  benefits  from  the 
parsimonious  covering  theory  for  a multiple  disorders  problem, 
and  by  combining  these  strategies  with  several  salient 
features  such  as  the  layered  structure  of  the  diagnostic 
reasoning,  the  categorical  causal  model,  the  concept  of 
partitioning  feature  space,  the  cardinality  constraint,  and 
the  multimembership  classification  method.  These  features  are 
discussed  in  the  following  sections. 

In  order  to  show  some  difficulties  in  directly  applying 
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numeric  (or  probabilistic)  measures  to  all  the  possible 
disorder  candidates  in  a multiple  disorders  problem  and  to 
illustrate  some  problems  in  using  the  causal  model  for 
shrinking  the  hypotheses  space,  we  have  the  following  example. 

Example  4 . 1 

Consider  a set  of  defects  (or  manifestations)  and  a set 
of  defect  causes  (or  disorders)  for  the  wave  soldering 
subprocess  in  PCB  manufacturing  operation  discussed  in  Chapter 
III,  where  (d,.)  is  a set  of  possible  disorders  and  (m. } is  a 
set  of  possible  manifestations.  In  this  example,  we  consider 
16  disorders  and  11  manifestations  involved  in  the  subcategory 
of  wave  solder  defects. 


( dj } — { d^ , dj , dj , d4 , dj , d^ , d^ , dg , d^ , d^ , d^  , d^ , j , d14 , d^ , d^ } 


d1  : solder  temperature  high 
d3  : immersion  depth  high 
d5  : solder  wave  uneven 
d7  : preheat  temperature  high 
d9  : conveyor  speed  high 
dn:  conveyor  vibration 
d13:  flux  blow-off  excessive 
d15:  fluxer  uneven 


d2  : solder  temperature  low 
d4  : immersion  depth  low 
d6  : excessive  solder  dross 
d8  : preheat  temperature  low 
d10:  conveyor  speed  low 
d12:  early  removal  of  board 
dK:  board  not  seated  right 
d16:  pallet  too  hot 


{ HI  • } { mi  f HU  r f ^4  t ^5  t ^5  / ^7 1 ^3  t ^9  / ^ q 1 1 ) 


m1  : excess  solder 
m3  : no  solder 
m5  : icicles 
m7  : pin  holes 
m,  : disturbed 
m^:  flux  entrapment 


m2  : insufficient  solder 
m4  : bridging 
m6  : poor  wetting 
m8  : contaminated 
m10:  grainy  solder 
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Suppose  that  the  causal  relations  between  disorders  and 
manifestations  are  known  as  follows,  where  rfj-  is  the  causal 
relation  between  the  ith  disorder,  d, , and  the  jth 
manifestation,  . That  is,  if  r^  = 1,  df  may  cause  iru . And  if 
rSj.  = 0,  d.  never  causes  m^ . 
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In  Example  4.1,  when  it  is  assumed  that  all  the  possible 
disorders  are  mutually  exclusive  (like  the  PADIKS  approach), 
only  16  disorder  candidates  are  possible.  However,  the  number 
of  the  possible  disorder  candidates  is  216  when  the  multiple 
disorders  problem  is  considered.  Thus,  as  the  number  of 
possible  disorders  increases,  the  computational  complexity 
involved  in  a multiple  disorders  problem  is  exponentially 
increased.  Though  the  concept  of  hierarchical  representation 
in  the  PADIKS  approach  is  introduced  to  focus  on  a small 
fraction  of  the  initial  hypotheses  space,  it  is 
computationally  intractable  to  apply  Bayes'  theorem  directly 
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to  2 16  hypotheses.  Thus,  the  causal  relations,  [ r 1- j ] , should  be 
utilized  to  find  only  a set  of  the  disorder  candidates  which 
can  explain  all  the  observed  manifestations. 

Suppose  that  a set  of  manifestations,  QM  = (m,,  m8,  m^, 
m10},  is  observed.  By  finding  the  minimum  subset  of  the 
disorder  candidates  explaining  QM  on  the  basis  of  the  causal 
relations,  we  can  shrink  the  initial  hypotheses  space  into  a 
small  subset,  i.e.,  {(d,,d2),  (d,,d5),  (d,,d9),  (d,,du),  (d10,d2)  , 

( ^io ' ^5 ) ' ( ^io ' ' ( ^9 ' ' ( ^9 '^11^  ' ( ^9 ' ^12 ^ J ’ However , 

since  the  causal  model-based  approach  relies  on  only  the 
binary  causal  relations,  [r,--],  we  may  have  non-plausible 
disorder  candidates  by  ignoring  the  relationships  between 
multiple  simultaneous  disorders  and  may  have  disorder 
candidates  which  have  unreasonably  large  cardinalities  (number 
of  multiple  simultaneous  disorders) . For  instance,  the 
disorder  candidate,  (d^dj),  hypothesized  above  is  non- 
plausible,  because  d1 , solder  temperature  high,  cannot  be 
present  simultaneously  with  d2,  solder  temperature  low.  Thus, 
to  compensate  this  model-based  approach,  another  reasoning 
process  should  be  designed  to  limit  the  cardinality  of 
multiple  disorders  and  to  eliminate  non-plausible  disorder 
candidates  before  applying  the  probabilistic  measure  to  the 
hypothesized  disorder  candidates  for  the  differential 
diagnosis . 

Thus,  the  primary  steps  of  the  proposed  diagnostic 
mechanism  are:  (1)  Combined  with  the  categorical  knowledge 
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representation  scheme,  the  causal  relations,  [r^-],  are 
utilized  to  limit  the  number  of  hypotheses  considered  (model- 
based  candidate  generator) , (2)  Several  constraints  are 
designed  to  eliminate  non-plausible  disorder  candidates  and  to 
further  reduce  the  number  of  hypotheses  considered 
(constraint-based  filter),  and  (3)  By  removing  unreasonable 
assumptions  such  as  the  mutual  exclusiveness  of  possible 
disorders,  a new  version  of  the  pattern-directed  approach  is 
employed  to  rank  a set  of  hypotheses  considered  (pattern- 
directed  ranker) . The  details  of  the  proposed  approach  are 
discussed  in  the  following  sections. 

4.2  Design  Principles 

A new  diagnostic  reasoning  mechanism  on  the  basis  of  the 
categorized  manifestations  is  proposed  to  effectively  combine 
the  categorical  causal  model  with  the  pattern-directed 
approach  and  to  employ  some  domain  independent  heuristics  for 
making  this  integrated  model  applicable  (computationally 
tractable)  for  more  general  diagnostic  problems.  In  fact,  we 
focus  our  attention  on  improving  the  diagnostic  capabilities 
of  the  PADIKS  approach,  developed  at  the  Center  for 
Information  Research  (CIR) , by  dramatically  modifying  its 
diagnostic  inference  structure,  by  systematically  shrinking 
the  initial  hypotheses  space  to  reduce  the  computational 
complexity  of  a multiple  disorders  problem,  and  by  adding  a 
coherent  and  theoretically  justifiable  means  to  rank 
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hypothesized  disorder  candidates. 

In  designing  a new  inference  mechanism  for  general 
diagnosis  problem  solving,  we  aim  at  accomplishing  the 
following  goals: 

(1)  To  effectively  solve  a multiple  disorders  problem  where 
manifestations  may  be  due  to  several  simultaneous 
disorders  (multimembership  classification  problem) ; 

(2)  To  develop  a systematic  way  to  shrink  the  hypotheses 
space  on  the  basis  of  the  symbolic  causal  model 
(categorical  causal  model) ; 

(3)  To  make  use  of  the  classif  icatory  property  of  the 
application  domain  on  the  basis  of  the  entity-category- 
relation  model  (content-associative  category  file) ; 

(4)  To  design  novel  diagnostic  heuristics  preventing  the 
combinatorial  explosion  due  to  the  elimination  of 
hypotheses  independence  assumption,  which  make  a new 
inference  mechanism  applicable  for  more  general 
diagnostic  problems  (constraint-based  heuristics  for 
ambiguity  resolution) ; 

(5)  To  modify  the  PADIKS  approach  to  rank  the  hypothesized 
disorder  candidates  by  removing  the  unreasonable 
assumptions  such  as  manifestation  independence  for  a more 
general  diagnostic  problem  solving  (pattern-directed 
ranker) . 

To  achieve  the  above  goals,  the  multi-level  diagnostic 
mechanism  is  proposed.  As  shown  in  Figure  4.1,  this  layered 


80 


architecture  consists  of  three  different  levels  of  diagnostic 
reasoning  operations,  namely,  (1)  the  model-based  candidate 
generator  for  a multiple  disorders  problem  on  the  basis  of  the 
categorical  causal  model,  which  dramatically  shrinks  the 
initial  hypotheses  space  into  a small  fraction  of  the 
hypothesized  candidate  space  relevant  to  the  observed 
manifestations;  (2)  the  constraint-based  filter  for  further 
shrinking  the  hypothesized  candidates  by  applying  the 
generalized  diagnostic  heuristics  which  utilize  various 
constraints  on  disorders  in  hypothesized  candidates  and 
resolve  some  domain-dependent  ambiguities;  (3)  the  pattern- 
directed  ranker  for  differentiating  the  hypothesized 
candidates  to  recommend  the  most  plausible  solutions  to  the 
operators  by  using  the  modified  version  of  the  PADIKS 
approach. 

Combined  with  its  categorical  causal  model,  this  layered 
scheme  has  a great  flexibility  for  easily  modifying  and 
updating  its  reasoning  mechanism  without  much  difficulties.  In 
fact,  any  modification  in  one  layer  does  not  affect  other 
layers  (upper  layers  or  lower  layers)  as  long  as  the  interface 
protocols  are  maintained,  which  is  the  well-known  benefit  of 
a layered  system  structure. 

This  multi-level  diagnostic  reasoning  mechanism,  which 
consists  of  the  model-based  candidate  generator,  constraint- 
based  filter,  and  pattern-directed  ranker,  provides  a highly 
efficient  diagnostic  procedure  and  performs  an  effective 
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reduction  of 

the  hypotheses  space.  Figure 

4.2 

shows  how 

the 

multi-level 

diagnostic 

reasoning  mechanism 

reduces 

the 

hypotheses 

space  and 

differentiates 

(or 

ranks) 

the 

hypothesized  disorder  candidates. 

Hypotheses  Space  Reduction 

In  a diagnostic  system  for  fault  diagnosis  and 
supervisory  control  in  a distributed  process  environment,  the 
computational  complexity  is  one  of  the  most  important 
technical  issues  because  of  its  requirement  of  time  critical 
solutions.  Especially  when  multiple  simultaneous  disorders  can 
be  present  and  the  number  of  possible  disorders  and 
manifestations  is  very  large,  it  is  almost  impossible  for  the 
conventional  Bayesian  approach  to  expect  timely  solutions  for 
a general  diagnostic  problem. 

To  reduce  the  computational  complexity,  we  make  use  of 
the  classificatory  property  of  our  application  domain  by 
defining  the  hierarchical  knowledge  structure  which  consists 
of  entities,  categories,  subcategories,  and  so  on.  Then, 
provision  is  made  for  subcategories  of  the  category  where  a 
subset  of  measurements  or  features  are  defined  for  each 
subcategory.  Since  the  number  of  symptoms  and  test  results  is 
very  large  in  our  application  domain,  the  use  of  a subcategory 
concept  helps  us  to  reduce  the  computational  complexity,  i.e., 
to  prevent  the  combinational  explosion,  which  is  also  one  of 
the  most  important  features  of  the  PADIKS  approach  (Tou 
[1985a] ) , Dai  [1985]) . 
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Figure  4.2  Multi-level  Diagnostic  Reasoning  Procedure 
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Furthermore,  we  expand  PADIKS  concept  by  employing  a new 
approach  to  systematically  reducing  the  hypotheses  space. 
First,  the  categorical  causal  model  is  combined  with  the  set 
covering  concept  in  order  to  efficiently  handle  a multiple 
disorders  problem.  By  performing  the  diagnostic  reasoning 
based  on  this  strategy,  we  can  focus  on  a set  of  the 
hypothesized  disorder  candidates,  relatively  a small  fraction 
of  the  initial  hypotheses  space,  which  explains  (or  covers) 
all  the  observed  manifestations.  Among  those  candidates,  we 
eliminate  the  candidates  violating  the  constraints  imposed  on 
the  disorder  relationships  (disorder  dependence) , the  maximum 
allowable  cardinality,  and  the  disorder-feature  relationships, 
which  are  discussed  in  the  following  sections.  After  applying 
all  the  possible  constraints  to  the  hypothesized  disorder 
candidates,  we  are  able  to  generate  a set  of  conflict-free 
disorder  candidates,  called  disorder  concepts.  Since  the  final 
goal  of  diagnostic  problem  solving  is  to  provide  a set  of  the 
most  plausible  disorder  candidates  (consisting  a single 
disorder  or  multiple  simultaneous  disorders)  with  weight 
factors  for  ranking  all  the  hypothesized  ones.  In  order  to 
perform  this  differential  diagnosis,  the  modified  version  of 
PADIKS  approach  is  applied  to  the  hypothesized  candidates, 
which  effectively  handles  a multiple  disorders  problem  and 
eliminates  the  unreasonable  assumptions  employed  in  the 
previous  PADIKS  approach. 
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Basic  Definitions  and  Notations 

Before  discussing  the  details  of  the  proposed  multi-level 
diagnostic  mechanism,  here  we  present  several  important 
definitions  and  notations  which  make  later  discussions  clear. 

First,  we  define  D = {d,,  d2,  ...,  d. , ...,  d{)  for  each 
categorized  domain,  the  set  of  all  the  possible  disorders  (or 
disorder  classes) , where  d;  is  the  ith  individual  disorder  and 
1 is  the  number  of  possible  disorders.  Accordingly,  M = (m,, 
ra2'  •••/  / •••/  mk } , the  set  of  all  the  possible 
manifestations,  is  defined  as  the  features  of  the  possible 
disorders  in  that  domain,  where  is  the  ith  individual 
manifestation  and  k is  the  number  of  possible  manifestations. 

In  a multiple  disorders  problem,  we  have  to  define  two 
different  types  of  disorder  candidate  sets.  First,  C0  = {C.,, 
C2'  •••'  cn},  the  se^  aH  possible  disorder  candidates, 
consists  of  all  the  2 1° I (or  2l)  possible  intersections  of  (dv 
d2,  ...,  dt } , where  n is  the  number  of  possible  candidates, 
i.e.,  2 1 . Cn  is  the  nth  possible  candidate  which  consists  of 
single  or  multiple  disorders  in  D.  We  can  also  define  the  set 

of  disorder  candidates,  C+  = {C},  C*,  , C*},  which  explains 

all  the  observed  manifestations.  Thus  C*  is  the  pth  disorder 
candidate  consisting  of  single  or  multiple  disorders  in  D, 
which  explains  all  the  observed  manifestations.  Obviously,  C* 
is  only  a small  subset  of  C in  most  problem  domains. 
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4.3  Model-Based  Candidate  Generator 

As  discussed  before,  the  model-based  candidate  generator 
is  designed  to  hypothesize  a set  of  disorder  candidates  which 
can  explain  all  the  observed  manifestations  on  the  basis  of 
the  causal  associations  between  disorders  and  manifestations. 
To  solve  a multiple  disorders  problem  and  to  effectively  limit 
the  number  of  hypotheses  considered  (or  disorder  candidates) , 
set  covering  theory  is  employed  to  utilize  the  causal 
associations . 

The  PADIKS  approach  is  the  combination  of  pattern 
recognition  techniques  and  the  knowledge-based  approach  with 
a content-associative  knowledge  representation  scheme.  The 
diagnostic  problem  using  classical  pattern  recognition 
techniques  is  concerned  with  the  assignment  of  a set  of 
observed  manifestations  to  one  and  only  one  (disorder)  class 
from  a given  set  of  (disorder)  classes,  d1 , d2,  ...,  dm. 
However,  in  solving  real  world  problems,  we  are  often  faced 
with  simultaneous  occurrence  of  multiple  faults.  In  complex 
systems,  for  example,  multiple  components  may  fail 
simultaneously;  on  the  shopfloor,  manufacturing  operations 
occasionally  have  more  than  one  disorder  (or  defect  cause) . To 
handle  such  situations,  the  human  diagnostician  makes  use  of 
a knowledge  base  organized  as  an  associative  network.  The 
central  component  of  this  associative  knowledge  is  causal 
associations  between  disorders  and  manifestations. 
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Candidate  generator.  A (disorder)  candidate  is  a 
particular  hypothesis  for  how  the  manifestations  are  present 
(Kleer  and  Williams  [1986]).  Ultimately,  the  goal  of  diagnosis 
is  to  identify  and  refine  the  set  of  candidates  consistent 
with  the  observed  manifestations.  A candidate  is  represented 
by  a set  of  assumptions,  namely,  a set  of  hypothesized 
disorders.  As  every  candidate  must  explain  every  observed 
manifestation,  each  set  representing  a candidate  must  have  a 
non-empty  intersection  with  every  manifestation. 

In  medicine,  a candidate  is  a set  of  diseases  (or 
disorders)  which  explains  all  the  observed  symptoms  (or 
manifestations) . Before  any  observations  have  been  taken  we 
know  nothing  about  the  problem.  The  size  of  the  initial 
candidate  space  grows  exponentially  with  the  number  of 
possible  disorders,  |D|.  In  a general  diagnostic  problem,  any 
disorder  could  be  present,  thus  the  candidate  space  for  |D|  = 
4 initially  consists  of  2k  = 16  candidates. 

It  is  essential  that  candidates  be  represented  concisely. 
Candidates  have  the  property  that  any  superset  of  candidate 
must  be  a candidate  as  well.  Thus  the  space  of  all  candidates 
consistent  with  the  observations  can  be  represented  by  the 
minimum  candidates.  The  goal  of  the  candidate  generator  is  to 
identify  the  complete  set  of  minimal  candidates  on  the  basis 
of  the  observed  manifestations.  The  space  of  candidates  can  be 
visualized  in  terms  of  a subset-superset  lattice  as  shown  in 
Figure  2.2.  The  minimal  candidates  then  define  a boundary  such 
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that  everything  from  the  boundary  up  is  a valid  candidate, 
while  everything  below  is  not. 

The  model-based  candidate  generator  is  designed  to 
hypothesize  the  set  of  minimal  disorder  candidates  based  on 
the  causal  associations  between  disorders  and  manifestations. 
Especially,  the  set  covering  principle  is  combined  with  the 
categorical  causal  model  emphasizing  the  taxonomic  property  of 
the  associative  knowledge  in  order  to  effectively  perform  the 
candidate  generation. 

4.3.1  Categorical  Causal  Model 

In  attempting  to  applying  a causal  model  to  a complex  and 
dynamic  domain,  the  conventional  approaches  to  representing  a 
causal  model  turn  out  to  be  inadequate  due  to  their  poor 
flexibility  and  expandability  in  a complex  problem  domain. 
Therefore,  we  introduce  a hierarchical  framework  for 
representing  causal  knowledge  at  the  various  level  of 
abstraction.  Such  hierarchical  representation  supports 
reasoning  at  different  levels  of  abstraction,  which  allows  the 
system  to  focus  quickly  on  the  problem  by  postponing  a more 
detailed  analysis  until  it  is  necessary. 

The  potential  applications  of  causal  models  are  limited 
due  to  the  static  nature  of  model  construction.  Once  a single 
causal  model  is  constructed  for  the  model-based  reasoning  for 
a complex  system,  the  dynamic  characteristics  of  the  causal 
model  are  not  easily  modified.  To  alleviate  this  problem,  we 
present  the  categorical  causal  model  which  consists  of  various 
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causal  models  for  different  levels  of  abstraction.  The 
categorical  causal  model  is  constructed  and  modified  on  the 
basis  of  the  observed  manifestations  with  the  aid  of  the 
process  hierarchy  and  reasoning  constraints  such  as  relations 
between  subprocesses  involved  in  a certain  task. 

Especially  in  a distributed  process  environment,  many 
subprocesses  cooperate  to  perform  a specific  manufacturing 
task.  Different  sets  of  subprocesses  may  be  involved  in 
different  tasks.  The  functional  hierarchy  and  the  structural 
hierarchy  of  a distributed  process  can  be  utilized  to  generate 
the  corresponding  categorical  causal  model. 

As  an  example,  Figure  4.3  shows  the  functional  hierarchy 
of  subprocesses  and  the  corresponding  hierarchy  of  the  causal 
model  in  a distributed  process  environment.  Depending  on  the 
operational  constraints  which  describe  subprocesses  and  their 
hierarchical  relationships  involved  in  a certain  task,  the 
categorical  causal  model  is  reconstructed  and  modified.  This 
strategy  shrinks  the  search  space  by  automatically  focusing  on 
the  possible  subspace  and  removing  the  rest  of  the  search 
space  in  the  diagnostic  reasoning  process. 

In  fact,  the  reconstruction  of  the  categorical  causal 
model  is  not  a difficult  task,  when  the  causal  model  for  each 
subprocess  is  built  on  the  knowledge  base  (knowledge  details) 
and  the  categorical  relationships  between  subprocesses  are 
clearly  defined  on  the  knowledge  base  (knowledge  sketch)  for 
each  task.  The  categorical  causal  model  provides  overall 
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Figure  4.3  Construction  of  the  Categorical  Causal  Model 

(a)  Subprocess  Configuration 

(b)  Causal  Model  Configuration 
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sketch  of  the  global  causal  association  which  consists  of 
causal  models  for  subprocesses.  Thus,  we  call  these  individual 
causal  models  "local  causal  models"  due  to  their  locality  to 
the  corresponding  subprocesses  and  call  the  categorical  causal 
model  "global  causal  model"  defining  the  categorical  relations 
between  local  causal  models.  Local  causal  models  have 
relatively  static  characteristics  even  though  their  causal 
associations  can  be  updated  and  modified  via  an  interactive 
user  interface.  However,  since  the  global  causal  model  is 
constructed  on  the  basis  of  various  subprocess  configurations, 
it  has  dynamic  characteristics. 

Definition  4.1 

A "categorical  causal  model"  (C2M)  , for  a distributed 
process  environment,  is  a set  of  interrelated  causal  models 
(CMs) . Let  C2M  be  the  categorical  causal  model  for  a certain 
task. 


C2M  = {CM} 

- { CM1 , CM1 1 , CM1  j , CM1  j , • • • , CM^  ^ , CM^  ...  } 

= { {CM,.},  { CM,-  ■ } , { CMj  jk  } , } 


where  CM  is  the  finite  set  of  local  causal  models  associated 
with  a set  of  vertices  in  the  knowledge  sketch  of  the 
diagnostic  model.  {CMf}  is  the  finite  set  of  local  causal 
models  at  the  first  level  of  the  C2M  hierarchy,  where  CM,,  is 
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the  ith  local  causal  model;  {CM^. } is  the  finite  set  of  local 
causal  models  at  the  second  level  of  the  C2M  hierarchy,  where 
CMj j is  the  jth  subcategory  causal  model  of  CM,.;  and  so  on.  In 
the  following  discussions,  we  focus  our  attention  on  the 
symbolic  reasoning  based  on  the  individual  local  causal  model. 
C Local)  causal  model 

Causal  models  have  been  widely  used  in  model-based 
reasoning  for  diagnostic  problem  solving.  A causal  model 
encodes  the  structure  and  behavior  of  the  system  it 
represents,  making  explicit  the  causal  relationships  among  the 
system  components  and  behavior  states.  Thus,  a causal  model 
provides  the  explanation  basis  of  the  system  state,  i.e.  what 
is  happening  in  the  system,  what  caused  it  to  happen  and  what 
will  happen  in  the  system.  One  of  the  most  interesting 
features  of  the  causal  model  is  that  it  contains  only  domain 
knowledge.  No  control  knowledge  to  implement  a particular  type 
of  reasoning  is  included  in  the  model,  which  makes  the 
knowledge  base  maintenance  easy  (Hudlicka  [1988]).  Since  the 
cause-and-ef feet  diagrams  are  utilized  to  build  the  causal 
model,  we  discuss  the  concept  of  these  diagrams  in  the 
following  section. 

Cause-and-ef feet  diagrams 

The  cause-and-ef feet  diagram  was  developed  by  Ishikawa 
(Ishikawa  [1976]).  It  has  been  widely  used  for  the  quality 
assurance  in  manufacturing  industries.  It  is  a top-down 
diagrammatic  technique  used  to  determine  the  cause (s)  for  a 
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given  effect.  This  technique  defines  causes  that  can  be 
monitored  to  give  an  early  warning  of  trends  which  ultimately 
reduce  productivity.  To  produce  this  diagram,  a major  arrow  is 
drawn  pointing  to  the  effect.  Minor  arrows  which  generically 
categorize  probable  causes  are  then  joined  to  the  trunk  of  the 
main  arrow.  Subsets  of  arrows  that  define  specific  causes 
related  to  the  general  cause  are  then  added.  An  abbreviated 
example  is  illustrated  in  Figure  4.4  (a)  . The  cause-and-ef feet 
diagram  can  also  be  constructed  using  the  process  flow  diagram 
as  the  main  trunk,  as  shown  in  Figure  4.4  (b)  . This  is 
appropriate  to  use  when  it  is  suspected  that  the  effect  being 
studied  can  be  produced  by  a number  of  causes  distributed 
through  many  process  steps,  such  as  a distributed  process 
environment. 

Because  it  is  a well-known  and  popular  method  in 
industries  and  its  categorical  causal  classification  is  a 
perfect  fit  for  generating  a causal  associative  model,  the 
cause-and-ef feet  diagram  is  utilized  in  building  the  causal 
knowledge  base  for  fault  diagnosis  and  supervisory  control  in 
a distributed  process  environment.  Figure  4.5  illustrates  an 
example  of  the  cause-and-ef feet  diagram  for  the  solder  defect, 
"no  solder  in  hole",  which  is  frequently  found  in  our 
application  domain,  i.e.,  a PCB  manufacturing  operation. 

By  utilizing  the  cause-and-ef feet  relation  based  on  the 
field  engineers'  experience  which  depicts  the  possible  causes 
(disorders)  for  each  defect  (manifestation) , the  causal  model 
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Figure  4.5  An  Example  of  Cause-and-Ef feet  Diagram  in  a PCB  Manufacturing  Operation 


96 


can  be  defined  as  follows: 

Definition  4.2 

The  causal  model,  CM,  for  a general  diagnostic  problem 
solving  can  be  formulated  as  a 4-tuple  <H,D,M,R>,  where: 

H = {hv  h2,  ...,  hj , ...,  hn}  is  the  finite  set  of  hypotheses 
associated  with  a set  of  vertices  in  the  knowledge  sketch 
of  the  diagnostic  problem. 

D = (d.,,  d2,  dj,  ...,  dL)  c H is  the  finite  set  of 

concepts,  called  disorders,  associated  with  the  highest 
level  of  nodes  in  the  knowledge  sketch  of  the  diagnostic 
problem. 

M = { m1 , m2,  ...,  nij , ...,  mk}  c H is  the  finite  set  of 

concepts,  called  manifestations,  associated  with  the 
lowest  level  of  nodes  in  the  knowledge  sketch  of  the 
diagnostic  problem. 

R = {r,,  r2,  ...,  r,.,  ...,  rm)  c H x h is  the  set  of  relations 
between  hypotheses  in  the  knowledge  sketch  of  the 
diagnostic  problem. 

Again,  the  causal  model  defined  in  Definition  4.2  does 
not  represent  the  overall  problem  domain,  but  defines  the 
causal  association  in  a certain  subprocess,  i.e.,  one 
subcategory  of  the  problem  domain.  For  instance,  the  causal 
model  defined  here  may  represent  the  automatic  solder 
subprocess  instead  of  whole  PCB  manufacturing  process. 
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Since  we  utilize  the  cause-and-ef f ect  diagram  for  the 
generation  of  diagnostic  knowledge,  the  following  sets  can 
naturally  be  defined  on  the  basis  of  the  causal  model: 

effects  (dj)  = {mj  <df,  m.>  e R,  d;  e D} 
causes  (ur)  = {d;|  <df , m.>  e R,  m^  e M}. 

The  construction  of  a causal  model  is  illustrated  by 
taking  the  following  example  for  the  causal  relations  between 
solder  defects  and  their  causes,  in  a PCB  manufacturing 
operation. 


Example  4 . 2 

Consider  a part  of  the  causal  model  <H,D,S,R>  for  the 
wave  soldering  process  in  PCB  manufacturing  operation,  where 


H = D u M 


D = 

{ d^ , d2 , d3 , d4 , dg , d6 , d7 

• ^8  ^ 

d1: 

solder  temperature  high 

d2:  immersion  depth  high 

d3: 

immersion  depth  low 

d4:  preheat  temperature  high 

d5: 

preheat  temperature  low 

d6:  conveyor  speed  low 

d7: 

flux  blow-off  excessive 

dg:  pallet  too  hot 

I = 

{ m^ , nij , m3 , m4 , m3 , m^ } 

m1 : 

excess  solder 

m2:  insufficient  solder 

m3: 

bridging 

m4:  icicles 

m5 : 

pin  holes 

m6:  grainy  solder 
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Based  on  the  experts'  knowledge  and  the  process  control 
record,  we  can  generate  the  causal  relations  between  disorders 
and  manifestations  as  shown  in  Table  4.1.  These  causal 
relations  can  precisely  be  represented  by  the  causal  network 
of  Figure  4.6. 

4.3.2  Set  Covering  Theory  with  Categorical  Causal  Model 

The  real  goal  of  the  diagnostic  process  is  to  generate  a 
diagnosis  which  can  explain  all  the  observed  manifestations. 
Since  the  system  can  have  multiple  faults,  the  correct 
diagnosis  answer  will  often  consist  of  a number  of  disorders, 
each  of  which  explains  some  of  the  manifestations,  but 
together  they  account  for  all  the  observed  manifestations  in 
a best  way.  In  general,  the  problem  of  picking  the  best  subset 
of  hypotheses  is  a version  of  the  "set-covering"  problem  and 
as  such  is  computationally  very  complex  (Milne  [1987]). 

In  the  set  covering  problem,  the  underlying  knowledge  is 
represented  by  an  associative  tree  structure,  namely,  the 
categorical  causal  model,  which  consists  of  many  interrelated 
local  causal  models.  As  discussed  in  the  previous  section,  in 
each  causal  model,  there  are  three  types  of  discrete  finite 
sets  which  define  the  scope  of  diagnostic  problem:  D, 
representing  all  possible  disorders  df , M,  representing  all 
possible  manifestations  nij , and  R,  representing  all  causal 
relations  between  df  and  nr . As  illustrated  in  Figure  4.7  (a), 
the  model-based  diagnostic  knowledge  is  organized  by 
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Table  4.1  The  Relations  of  Disorders  and  Manifestations  in 
Example  4.2 


di 

effects (dj ) 

d1 

m2  m5 

d2 

d3 

m2  m5 

d4 

1&2 

d5 

m3  m5 

d6 

mi  m6 

d7 

m2  m3  m4 

d8 

m1  m2  in^ 
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di  d2  d3  d4  ds  de  d7  ds 


Figure  4.6  Causal  Network  Representation  for  Example  4.2 
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associating  d-  and  iru  based  on  their  causal  relation  r^  c R. 

Let  Q = { q, , q2,  . ..,  q, , qp)  c H be  a query  vector 
that  consists  of  the  observed  hypotheses.  In  fact,  the 
observed  hypotheses  vector,  Q,  is  the  union  of  a set  of 
observed  manifestations,  QM,  and  a set  of  observed  disorders, 
Q , i.e.  Q = Qm  u Q0.  However,  we  concentrate  on  diagnostic 
problem  solving  to  find  a set  of  disorder  candidates  which  can 
explain  (or  cover)  the  observed  manifestations  on  the  basis  of 
Qm , D,  and  R' , where  QM  c M and  R'  = D x QM  c R.  Therefore  we 
can  formulate  the  diagnostic  problem  solving  based  on  the 
causal  model  and  the  observed  manifestations  as  follows: 

A model-based  diagnostic  problem,  Pd,  in  each  problem  domain 
is  to  find  a set  of  disorder  candidates  explaining  all  the 
observed  manifestations.  That  is, 

Pd(CM,  Qm)  = Pd(H,  D,  M,  R,  Qm) 

= {C*  | C*  c D,  Qm  c effects  (C*)  } 

where  CM  is  the  local  causal  model  corresponding  to  the 
observed  manifestations  QM.  C*  is  the  ith  disorder  candidate 
explaining  QM  in  a best  way.  In  other  words,  Pd  is  to  pick  the 
best  subset  of  hypotheses  which  cover  all  the  observed 
manifestations,  QM . Figure  4.7  (b)  illustrates  the  concept  of 
set  covering  problem,  where  the  best  subset  is  defined  as  the 
minimum  subset,  namely,  a set  of  the  minimum  disorder 
candidates,  C* , among  all  the  possible  disorder  candidates  ,CD. 
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(a) 


(b) 


Figure  4.7  Causal  Relations  and  the  Minimum  Covering  Set 
in  Hypotheses  Space 

(a)  Causal  Relations  between  D and  M in  the 
Hypotheses  Space 

(b)  Concept  of  the  Minimum  Covering  Set  Based 
on  M 
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In  order  to  find  the  minimum  subset  of  disorders  explaining 
(or  covering)  the  observed  manifestation,  we  introduce  the  set 
covering  theory  as  follows  (Edmonds  [1962]): 

Definition  4.3  (a) 

For  a finite  set  S of  elements  and  a family  F of  subsets 
of  S,  a cover  C of  S from  F is  a subfamily  C c F such  that 
u(C)  = S.  A cover  C is  called  minimum  if  its  cardinality  |C| 
is  as  small  as  possible. 

In  order  to  use  this  definition  in  finding  the  minimum 
subset  for  the  general  diagnostic  problem  solving,  we  have  to 
modify  the  definition  of  "cover".  Because  the  set  covering 
problem  is  to  find  the  hypotheses  which  explain  (not 
necessarily  equal  to,  but  contain)  the  observed 
manifestations,  Definition  4.3  (a)  is  revised  as  follows: 

Definition  4.3  fb) 

For  a finite  set  S of  elements  and  a family  F of  subsets 
of  S,  a cover  C of  S from  F is  a subfamily  C c F such  that  S 
c u(C) . A cover  C is  called  minimum  if  its  cardinality  |C|  is 
as  small  as  possible. 

In  this  definition,  S corresponds  to  the  observed 
manifestation,  QM , and  F corresponds  to  a set  of  disorders,  D, 
in  the  sense  that  each  d;  e D labels  a subset  of  QM  (the 
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intersection  of  effects (dj)  with  QM)  (Peng  and  Reggia  [1987c]) . 

A theory  of  diagnosis  could  be  based  on  the  minimal 
cardinality  principle.  In  real-world  diagnostic  problems, 
however,  we  can  find  some  cases  to  which  this  principle  cannot 
be  applied.  For  example,  we  may  have  the  plausible  covering 
set  for  a set  of  manifestations,  but  its  cardinality  is  not 
the  smallest  one.  Therefore,  we  define  the  modified  version  of 
minimum  covering  set  for  the  general  diagnostic  problem- 
solving : 

Definition  4.4 

Cmin  is  a minimum  covering  set  of  QM  iff  no  proper  subset 
of  Cmjn  is  a cover  of  QM , where  QM  is  a set  of  observed 
manifestations . 

However,  this  purely  causal  reasoning  based  on  Definition 
4.4  may  form  an  impractically  large  cardinality  of  the 
disorder  candidates  (the  minimum  covering  sets) . Thus,  to  make 
the  new  definition  of  the  minimum  covering  set  more  feasible, 
we  define  the  maximum  allowable  cardinality,  I C | |nax , which 
limits  the  cardinality  value  of  the  minimum  covering  set.  For 
instance,  based  on  the  history  data  in  diagnostic  control,  we 
may  heuristically  determine  the  maximum  allowable  cardinality. 
For  instance,  if  | C | max  = 2,  it  is  assumed  that  more  than  two 
disorders  cannot  occur  simultaneously,  i.e.,  two  is  the 
maximum  cardinality  of  the  candidate  sets  of  disorders.  The 
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operator  provides  the  system  with  the  maximum  cardinality 
value  to  limit  the  number  of  simultaneous  disorders  via  an 
interactive  user  interface.  Detail  strategies  of  using  Icl^ 
are  discussed  in  Section  4.4. 

Here  we  define  some  useful  concepts  for  solving  the  set 
covering  problem. 

Definition  4.5 

If  X = { x^ , x2 , • . • , xn } and  Y — { y i / Yj/  • • • # Ym}  3re 

finite  sets  and  R c X x Y is  a relation,  the  Boolean  incidence 
matrix  for  R is  the  matrix  [r,..],  where  rSj  = 1 if  <xj;  y->  e R, 
and  rfj.  = 0 otherwise,  i.e.,  <xf , y^>  f R. 

Since  a binary  relation  and  its  Boolean  incidence  matrix 
completely  determine  each  other,  we  shall  use  these  terms 
synonymously  herein;  thus  a row  of  R shall  be  a row  of  the 
Boolean  incidence  matrix  for  R,  etc.  In  fact,  minimum  covers 
are  equivalent  to  solutions  of  the  following  integer  program: 
Minimize  I x.  by  a vector  X = (x,,  x2,  ...,  xn)  1 of  zeroes  and 
ones  for  which  RX  > 1 = (1,  1,  . . . , 1)  ' . Here  R is  the  Boolean 
incidence  matrix  of  members  of  columns  versus  members  of  rows. 

Example  4 . 3 

The  causal  relations  described  in  Example  4.2  can  be 
represented  in  the  form  of  the  Boolean  incidence  matrix  as 


follows: 
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0 0 
1 0 
0 1 

R = 

0 1 
1 0 
0 0 

Definition  4.6 

Let  X and  Y be  sets  and 
c x x Y is  R-specif icity  if 
and  Y'  c Y such  that  R'  = X' 


0 0 0 1 0 1 
110  0 11 
0 0 10  10 
0 0 10  11 
10  10  0 1 
0 0 110  0 


R c X x Y be  a relation.  Then  R' 

(1)  R'  c R and  (2)  there  X'  c X 
x Y ' . 


In  the  diagnostic  reasoning,  the  Boolean  incidence  matrix 
(BIM)  based  on  the  causal  model,  CM  = <H,D,M,R>,  is  used  to 
find  the  minimum  covering  set.  In  fact,  we  make  use  of  a BIM 
for  R'  , a subset  of  R,  which  provides  the  relations  between 
the  observed  manifestations,  QM,  and  the  possible  disorders, 
D,  i.e.,  R'=  D x qm  c R.  Therefore,  the  reasoning  procedure  of 
causal  model-based  candidate  generator  can  be  summarized  as 
the  following  steps: 

(1)  To  retrieve  the  observed  manifestations,  QM,  and  the 
corresponding  causal  model,  CM,  in  the  knowledge  base. 

(2)  To  generate  the  Boolean  incidence  matrix,  R'=  D x qm  c R. 

(3)  To  find  the  disorder  candidates  (in  fact,  the  minimum 
covering  set  explaining  QM)  . 
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Example  4.4 

Let  the  observed  manifestations  be  QM  = {m,,  m5,  m6}  and 
use  the  causal  relations  between  disorders  and  manifestations 
shown  in  Table  4.1.  Initially,  QM  (the  observed 
manifestations)  , DQ  (the  possible  disorders  list  based  on  QM) , 
C*  (the  disorder  candidates)  are  all  empty.  We  can  generate  a 
set  of  diagnostic  solutions  by  following  the  procedures,  on 
the  basis  of  the  observed  manifestations  and  the  causal  model. 

Step  0 

Q„  = dq  = c+  = 

Step  1 

(1)  Get  the  manifestation  mf . 
m5  = mi 

(2)  Retrieve  causes  (m;)  from  the  causal  model,  i.e.  Table  4.1. 
causes  (m,)  = {d6,  d8) 

(3)  QM  - QM  u m. 

Qm  = (m,} 

(4)  DQ  <-  DQ  u causes (mt-) 

= ( ^6 ' ^8  ^ 

(5)  Adjust  the  minimum  cover,  Cm,  to  accommodate  mf . 

C+  = ({d6,  d8)) 

Step  2 : Repeat  Step  1 where  mf  = m5. 

(1)  m5 

(2)  causes  (m5)  = (d,,  d3,  dx,  d8) 
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(3) 

Qm  - 

{ml. 

ITlg} 

(4) 

Do  " 

{dv 

dj  , dg  , dg  , dg  } 

(5) 

C+  = 

( ( dg 

} , { dg  } X { d,  , 

Step  3 : Repeat  Step  1 where  m,.  = m6. 

(!)  m6 

(2)  causes  (m6)  = {d5,  d6) 

(3)  QM  = {ml,  m5,  m6) 

(4)  Dq  = {d1f  d3,  d5,  d6,  d8> 

( 5 ) C = ( { dg  } X { dg  , d6  } , { dg  } X { d^  , dg  , dg  } ) 

= ({d8,  dg},  {dg,  d6},  { d6,  d1 } , {d6,  d3},  {d6,  d5} 

By  repeating  the  above  steps,  we  can  get  the  solution,  a set 
of  the  hypothesized  disorder  candidates,  C+  = (C^,  Cj,  C3,  C*, 

Cj)  = ({dg,  dg},  {dg,  dg},  {dg,  d,},  {dg,  dj},  {dg,  dg  } ) , 63^ 

element  of  which  can  explains  all  the  observed  manifestations, 
Qm  = (m,,  mg,  nig).  Based  on  this  result,  the  following 
explanation  is  generated: 

If  the  manifestations,  {excess  solder,  pin  holes,  grainy 
solder},  are  observed  from  a PCB  manufacturing  operation,  the 
hypothesized  disorder  candidates  are  {pallet  too  hot  and 
preheat  temperature  low},  {pallet  too  hot  and  conveyor  speed 
low},  {conveyor  speed  low  and  solder  temperature  high}, 
{conveyor  speed  low  and  immersion  depth  low},  {conveyor  speed 
low  and  preheat  temperature  low}. 
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4.4  Constraint-Based  Filter 

The  constraint-based  filter  aims  at  reducing  the 
computational  complexity  of  the  differential  diagnosis  at  the 
next  stage  of  the  pattern-directed  ranker  by  further  shrinking 
the  hypothesized  disorder  candidates  space.  In  addition,  it  is 
able  to  compensate  the  model-based  diagnostic  reasoning 
because  of  its  capability  of  eliminating  non-plausible 
candidates.  To  achieve  this  goal,  among  the  disorder 
candidates  hypothesized  in  the  model-based  candidate 
generator,  non-plausible  ones  are  eliminated  by  using  the 
following  constraints  imposed  on  the  candidates: 

(1)  Expected  features  constraint  : To  compare  the  expected 
features  (or  effects)  of  the  hypothesized  disorder 
candidates  and  the  observed  manifestations  by  making  use 
of  the  concept  of  partitioning  the  feature  space  (feature 
verification) ; 

(2)  Disorder  dependence  constraint  : To  consider  the 

dependence  between  multiple  disorders  in  each  candidate, 
when  multiple  disorders  can  occur  simultaneously,  by 
assigning  the  belief  factors  utilizing  fuzzy  set  theory 
which  is  based  on  the  occurrence  frequency  of  the 
simultaneous  multiple  disorders. 

(3)  Cardinality  constraint  : To  use  of  the  maximum  allowable 
cardinality  of  multiple  disorders,  which  is  either  user- 
defined  or  automatically  selected,  to  eliminate  the 
disorder  candidates  having  multiple  simultaneous 
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disorders  more  than  the  specified  maximum  number. 

In  fact,  considering  the  disorder  dependence  as  a 
constraint  in  this  filtering  operation  makes  our  approach  more 
generic  by  isolating  some  domain-dependent,  system-dependent 
information  in  the  diagnostic  knowledge  base.  Furthermore,  we 
assign  the  belief  factor  to  each  set  of  multiple  disorders  by 
evaluating  the  possibility  of  simultaneous  occurrence  on  the 
basis  of  their  relationships,  which  is  also  utilized  to 
recalculate  the  maximum  cardinality  of  the  disorder  candidates 
(or  the  minimum  covering  sets) . 

4.4.1  Expected  Features  Constraint 

The  model-based  candidate  generator  forms  a set  of 
disorder  candidates  based  on  the  observed  manifestations.  Upon 
these  candidates,  inversely,  the  expected  manifestations, 
called  "features",  are  generated  so  that  we  can  examine  if  the 
observed  ones  include  the  significant  features  of  the 
hypothesized  disorder  candidates.  Here,  we  introduce  an 
important  concept  of  partitioning  the  feature  space  based  on 
their  significance  to  each  disorder  so  that  the  conflicts 
between  the  expected  features  and  the  observed  ones  can  be 
resolved  by  eliminating  these  ambiguous  disorder  candidates. 
Partitioning  the  feature  space 

In  the  symbolic  model-based  diagnostic  reasoning,  the 
causal  strength  between  disorder  and  manifestation  is  often 
quantified  only  in  a very  fuzzy  sense  of  IMPOSSIBLE,  UNLIKELY, 
POSSIBLE,  PROBABLE,  and  CERTAIN,  and  distinctive  rules  rather 
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than  a uniform  numerical  computation  are  used  to  combine  data 
different  degrees  of  likelihood  (Szolovits  and  Pauker  [1978]) . 
These  symbolic  probabilities  are  subjective,  non-numeric 
estimates  of  how  frequently  an  event  occurs.  While  exact 
probabilities  of  diagnostic  associations  are  usually  not 
available  (or  symbolic  causal  strengths  are  desired  for 
simpler  reasoning) , a great  deal  of  descriptive  information 
about  diagnosis  exists  in  the  following  form: 

(1)  d(.  certainly  causes  m.  (Type  I causality) 

(2)  dt.  probably  causes  m^  (Type  II  causality) 

(3)  dj  possibly  causes  n^.  (Type  III  causality) 

(4)  dj  unlikely  causes  nr  (Type  IV  causality) 

(5)  dj  never  causes  m^  (Type  V causality) . 

We  find  that  these  symbolic  causal  relations  are  very  useful 
in  further  shrinking  the  disorder  candidates  space 
hypothesized  by  the  model-based  candidate  generator. 

In  the  model-based  reasoning,  the  diagnostic  problem  of 
multiple  disorders  is  to  find  a set  of  disorder  candidates 
which  explains  all  the  observed  manifestations.  In  fact,  this 
problem  can  be  regarded  as  the  multimembership  classification 
problem  where  the  observed  manifestations  are  features  of  the 
disorder  (candidate)  classes.  That  is,  the  diagnostic  problem 
with  multiple  disorders  is  to  find  a set  of  disorder  classes, 
each  of  which  has  some  of  the  features  (or  manifestations)  , 
but  together  they  account  for  all  the  observed  features. 

The  feature  space  under  disorder  d;  consists  of  a set  of 
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possible  manifestations  (features)  caused  by  d( . Given  D,  M, 
and  R in  each  local  causal  model,  the  following  sets  can  be 
defined: 


F(dj)  = features  (d()  = {nr  | <dj,  mj>  e R} 

where  features (df ) , or  F(dj),  is  a set  of  features  for  the 
disorder  d; . 

Now  we  introduce  the  important  concept  of  partitioning 
the  feature  space  in  order  to  further  reduce  the  disorder 
candidate  space.  We  partition  the  feature  space  (or 
manifestation  space),  F(dj)/  into  two  meaningful  subspaces, 
i.e.,  "relevant  features"  and  "irrelevant  features".  The 
relevant  features,  FR(df)/  denote  a set  of  features 
(manifestations)  which  can  be  observed  when  df  is  present.  In 
other  words,  df  can  (i.e.  certainly,  probably,  possibly,  or 
unlikely)  have  the  relevant  features,  which  corresponds  to 
Type  I,  Type  II,  Type  III,  or  Type  IV  causality.  The 
"irrelevant  features",  F^dj),  denote  a set  of  features  which 
cannot  be  observed  when  d.  is  present.  That  is,  d;  can  never 
have  the  irrelevant  features  which  corresponds  to  Type  V 
causality.  First,  to  support  the  concept  of  partitioning  the 
feature  space,  two  relations,  R$  and  Rj , are  defined  as 


follows : 
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Definition  4.7 

If  D = {dv  d2,  dt}  and  M = {m,,  m2,  mk)  are 

finite  sets,  Rr  c d x m represents  that  nij  can  occurs  when  d,. 
are  present,  and  Rj  c D x m represents  that  iHj  never  occurs 
when  d{  are  present.  Thus,  the  relation,  R c D x m,  is 

R = Rr  + Rj , 

where  all  the  binary  relations  belonging  to  Rr  are  l's  and  all 
the  binary  relations  belonging  to  Rj  are  0's. 

Thus,  the  overall  feature  space  of  disorder  df  can  be 
partitioned  as  follows: 

Lemma  4 . 1 

The  feature  (or  manifestation)  space  of  each  disorder  is 
divided  into  two  subsets: 


F(d,)  = FR(d,-)  u F,(d,) 

= {mj  | <dj , mj>  e Rr)  + {mk  | <d, , mk>  e R, } . 

Lemma  4.1  is  self-evident,  thus  does  not  require  a proof. 

The  relevant  features  can  further  be  partitioned  into  two 
smaller  fractions  of  the  feature  space:  (1)  "significantly 

relevant  features  (SR  features)",  namely,  relevant  features 
with  Type  I causality,  and  (2)  "insignificantly  relevant 
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features  (IR  features)",  namely,  relevant  features  with  Type 

II,  Type  III,  or  Type  IV  causality.  The  set  of  SR  features, 
Fsr ( di ) , denotes  a set  of  features  (manifestations)  which  is 
always  observed  when  d;  is  present.  In  other  words,  d.  always 
has  the  relevant  features  with  Type  I causality.  The  set  of  IR 
features,  FIR(dj),  denotes  a set  of  features  which  may  be 
observed  (not  necessarily  observed)  when  ds  is  present.  That 
is,  dj  may  have  the  irrelevant  features  with  Type  II,  Type 

III,  or  Type  IV  causality.  First,  to  support  the  concept  of 
partitioning  the  feature  space,  two  relations,  Rsr  and  R,r,  are 
defined  as  follows: 


Fr  ^sr  + ^1R* 

In  fact,  R$r  and  Rir  are  exclusive  and  exhaustive  subsets  of  Rr. 
Thus,  the  relevant  features  of  disorder  d;  can  be  partitioned 
as  follows: 

Lemma  4 . 2 

The  relevant  features  (or  manifestation)  space  of  each 
disorder  is  partitioned  into  two  subsets: 

fr«U  = FsrW  + F.rW 

= | <d; , mj>  e Rsr)  + (mk  | <dj,  mk>  e R,r). 


Because  it  is  self-evident,  Lemma  4.2  does  not  require  a 
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proof. 

Figure  4.8  illustrates  the  concept  of  partitioning 
feature  space  for  each  disorder  d,. . To  make  use  of  feature 
space  partitioning  concept  for  further  reduction  of  the 
disorder  candidates,  the  following  assumption  is  made: 

Assumption  4.1  (SR  feature  independence) 

The  significantly  relevant  features  (SR  features)  of  one 
disorder  are  present  independently  of  other  disorders.  That 
is,  for  dv  d2,  ...,  dk  e D, 

Fsr(D)  = FSR(d1Ad2A Adk) 

= Fsr  (dl)  uFSR  (d2^  U*  • • • uFSR  • 

This  assumption  is  quite  different  from  and  much  preferable  to 
the  assumption  of  feature  independence  (or  manifestation 
independence)  widely  employed  in  the  previous  approaches.  This 
assumption  is  acceptable  in  most  problem  domains. 

Next,  the  important  theorem  of  the  non-plausible  disorder 
candidates,  which  are  no  longer  eligible  for  further 
reasoning,  is  derived  to  reduce  the  hypothesized  candidate 
space . 

Theorem  4 . 1 

Disorder  candidate,  C?,  is  non-plausible  and  eliminated 
from  a set  of  disorder  candidates,  C+,  if  it  violates  the 
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Figure  4.8  Feature  Space  Partitioning  for  Disorder  df 
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relevant  (Type  I)  causality.  That  is, 


Fsr(C,-)  = Ud.cCrFs«(di>  £ 

Proof  Since  C*  is  a disorder  candidate,  features (Cp 
equal  to  or  include  all  the  manifestations,  QM . Based  on  the 
SR  feature  independence,  FSR(Cp  = FSR  (d1 ) uFSR  (d2)  u.  . . . uF$R  (dj 
where  { d1 , d2,  ...,  dm)  e C*.  Also,  if  the  hypothesized 

disorders  occur,  the  SR  features  of  these  disorders  are 
certainly  present  so  that  QM  includes  those  SR  features  (based 
on  the  definition  of  SR  features)  . Thus,  F$R(C*)  c QM  always 
holds  true.  If  FSR(C*)  « QM , the  hypothesized  disorder 
candidate,  C?,  is  non-plausible. 

Example  4.5 

Consider  the  diagnostic  problem  in  Example  4.4.  By 
introducing  a new  concept  of  partitioning  the  feature  space, 
the  relations  between  disorders  and  manifestations  are 
represented  as  shown  in  Table  4 . 2 and  Table  4.3.  From  the 
diagnostic  result  of  Example  4.4,  we  have 

c+  = (c;,  cp  c;,  c*,  c;) 

= ({da,  d5},  (dg,  d6},  (d6,  d, } , (d6,  d3),  {d6,  d5}). 


By  using  Assumption  4.1, 
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Table  4.2  The  Relations  of  Disorders  and  Manifestations  Based 
on  the  Feature  Space  Partitioning 


disorder 

F(d,) 

= W')  u F,(di) 

di 

FR(d,) 

F,(d,-) 

d1 

{ ' 

m5  } 

{!«,< 

m3, 

ro4, 

d2 

{m3, 

m4} 

{ , 

m2, 

^5  / 

m6> 

d3 

{m2, 

m5} 

(m, , 

nij , 

m4/ 

V 

d4 

{m2} 

{mlf 

m3, 

m4, 

m5 

d5 

{m3, 

®5  / ^6 ) 

{ , 

m2) 

d6 

(a,, 

m6> 

{m2, 

IHj  , 

m4, 

m5} 

d7 

{m2, 

m3, 

m4} 

{m,, 

in3 , 

m6> 

d8 

(in,, 

' 

m4 , m5 } 

{m3, 

m6> 
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Table  4.3  The  Relations  of  Disorders  and  Manifestations  Based 
on  the  Feature  Space  Partitioning  in  the  Relevant 
Features 


disorder 

FR(d,-) 

= FSR(di)  U F I R ^ d i ) 

di 

FSR  (di  ) 

F.R(di) 

di 

{m2} 

{m5} 

d2 

{0} 

(m3, 

ra4) 

d3 

{0} 

(m2, 

m5  > 

d4 

{0} 

{m2} 

d5 

{0} 

{m3, 

m4, 

m5, 

V 

d6 

{ } 

d7 

{0} 

{m2' 

m3, 

d8 

{0} 

in2 , 

m4, 

m5  } 
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Fsr(C;)  = FSR(d8)  u FSR(d5)  = {0} 

FSr(^2^  = U ^SR  ^6^  ~ 

^SR^^3^  ~ ^SR^6^  U - m2^ 

FSr(CD  = FSR(d6)  U FSR(d3)  - <mi> 
fsr(c;)  = FSR(d6)  u FSR(d5)  = {ml}. 

Since  Fsr(C*)  £ QM , C*  should  be  eliminated  from  the  set  of 
disorder  candidates,  C* , by  Theorem  4.1.  The  new  minimum 
cover,  C+  = (C*,  , C*u,  C*)  , is  generated,  which  is  revised  by 

the  concept  of  partitioning  the  feature  space. 

Based  on  this  result,  the  revised  explanation  is 
generated  as  follows: 

If  the  manifestations,  {excess  solder,  pin  holes,  grainy 
solder},  are  observed  from  a PCB  manufacturing  operation,  the 
hypothesized  disorder  candidates  are  {pallet  too  hot  and 
preheat  temperature  low),  {pallet  too  hot  and  conveyor  speed 
low},  {conveyor  speed  low  and  immersion  depth  low},  {conveyor 
speed  low  and  preheat  temperature  low}.  However,  {conveyor 
speed  low  and  solder  temperature  high}  is  excluded  from  the 
disorder  candidates  because  the  SR  feature  of  {solder 
temperature  high},  {insufficient  solder},  is  not  observed. 

4.4.2  Disorder  Dependence  Constraints 

Most  of  the  previous  approaches  proposed  so  far  assume 
that  disorders  are  independent  of  one  another.  This  assumption 
is  not  always  correct  in  some  problem  domains.  However,  this 
assumption  seems  to  be  inevitable  because  the  computational 
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complexity  is  intractable  otherwise,  especially  when  a 
multiple  disorders  problem  is  assumed  and  when  the  number  of 
possible  disorders  is  very  large.  Therefore,  the  heuristic 
strategy  for  compensating  this  generally  unacceptable 
assumption  is  designed  for  more  generic  diagnostic  problem 
solving.  We  propose  to  assign  the  belief  factor  to  each  pair 
of  multiple  simultaneous  disorders  by  evaluating  the 
possibility  of  simultaneous  occurrence  on  the  basis  of  the 
past  history,  which  is  also  utilized  to  recalculate  the 
maximum  cardinality  of  the  disorder  candidates.  Especially  the 
"preventive"  relationship,  which  will  be  explained  in  the  next 
section,  is  utilized  to  eliminate  non-plausible  candidates. 
Two  primary  relationships  are  defined  as  follows: 

Preventive  relationship.  The  preventive  relationship 
describes  the  situation  where  the  presence  of  one  disorder 
prevents  another  disorder  from  being  present  at  the  same  time. 
For  instance,  we  may  have  a disorder  candidates,  {solder 
temperature  high,  solder  temperature  low),  in  the  process 
control  for  a PCB  manufacturing  operation.  Intuitively,  two 
elements  in  this  disorder  candidate  cannot  occur 
simultaneously.  Therefore,  the  hypothesized  candidate  having 
a couple  of  disorders  with  the  preventive  relationship  should 
be  removed  from  the  set  of  disorder  candidates  generated  by 
the  previous  operation,  i.e.,  the  model-based  candidate 
generator. 
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Supportive  relationship.  In  the  process  environment, 
more  than  one  disorder  can  occur  due  to  the  same  cause.  In  a 
special  case,  two  disorders  in  a disorder  candidate  set  almost 
always  occur  simultaneously  which  may  be  considered  as  a 
single  disorder.  Then,  these  two  disorders  have  the 
relationship  called  "supportive". 

Example  4.6 

Based  on  the  result  of  Example  4.5,  we  have  a set  of 
disorder  candidates,  C"  = (C*,  C*,  C*,  C*)  . C*  has  two 
simultaneous  disorders  {d8,  d5},  i.e.,  {pallet  too  hot,  preheat 
temperature  low).  However,  according  to  experts'  knowledge  and 
the  past  history  of  process  control,  these  two  disorders  have 
the  preventive  relationship  due  to  their  extremely  low 
possibility  of  simultaneous  occurrence.  Thus,  we  can  generate 
new  disorder  candidates,  C+  = (C*,  C *,  C*)  , with  the  following 
explanation: 

If  the  manifestations,  {excess  solder,  pin  holes,  grainy 
solder},  are  observed  from  a PCB  manufacturing  operation,  the 
hypothesized  disorder  candidates  are  {pallet  too  hot  and 
preheat  temperature  low},  {pallet  too  hot  and  conveyor  speed 
low},  {conveyor  speed  low  and  immersion  depth  low},  {conveyor 
speed  low  and  preheat  temperature  low}.  However,  {conveyor 
speed  low  and  solder  temperature  high}  is  excluded  from  the 
disorder  candidates  because  the  SR  feature  of  {solder 
temperature  high},  {insufficient  solder},  is  not  observed. 
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{pallet  too  hot  and  preheat  temperature  low}  is  excluded  from 
the  disorder  candidates  because  these  multiple  disorders  are 
unlikely  to  be  present  simultaneously. 

The  consideration  of  disorder  dependence  has  been  ignored 
in  most  previous  works.  Also,  the  fuzzy  set  concept  is 
introduced  to  represent  the  strength  of  the  supportive  (or 
preventive)  relationship.  The  basic  idea  of  a fuzzy  set  used 
here  is  very  simple.  Since  there  is  undecidability  concerning 
the  relationship  (preventive,  supportive) , we  can  consider  a 
function  defined  on  the  strength  of  a relationship  whose 
values  are  in  the  unit  interval  [0,  1]  instead  of  just  0 and 
1.  We  classify  the  strength  of  supportive  (or  preventive) 
relationship  into  five  different  categories  as  follows: 

{ strongly  supportive,  supportive,  do  not  care,  preventive, 
strongly  preventive  } 

Therefore,  we  may  quantize  different  degree  of  the  supportive 
into 

{ 1,  0.75,  0.5,  0.25,  0 }. 

For  example,  the  value  'O'  means  that  two  disorders  considered 
cannot  occur  simultaneously  and  should  be  eliminated  from  a 
set  of  disorder  candidates  before  further  processing.  On  the 
other  hand,  the  value  '1'  means  that  two  disorders  nearly 
almost  occur  simultaneously  and  may  be  handled  as  a single 
disorder  in  the  logical  meaning  of  their  cardinalities. 
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Definition  4.8 

The  relationship  between  two  simultaneous  disorders  can 
be  represented  by  the  following  fuzzy  set  concept: 

{ f ,•  j I f : (d,-,dj)  - [0,1],  1 < i,j  < 1,  i and  2l  = |ffj| 

where  f denotes  the  mapping  function  and  1 is  the  maximum 
number  of  allowable  simultaneous  disorders. 

Then  we  can  define  'Strength  of  the  Supportive  (SOS) ' to 
determine  overall  relationship  over  a set  of  disorders,  in 
general,  more  than  two  simultaneous  disorders. 

SOS  = min  [ f j j | f : (df , d_j ) , i,j  = 1,  2,  ...,  1,  i * j ] . 

Accordingly,  the  'Strength  of  the  Preventive'  can  be  defined 
as  follows: 


SOP  = 50S  = 1 - SOS. 


By  using  the  concept  of  SOS,  we  can  assign  a new  cardinality 
value  to  each  set  of  multiple  disorders  to  revise  the  original 
cardinality  value  which  does  not  consider  the  disorder 
dependence.  In  fact,  the  concept  of  disorder  dependence 
constraint  provides  a systematic  way  not  only  to  reduce  the 
disorder  candidate  space  by  eliminating  all  the  disorder 
candidates  with  the  preventive  relationship,  but  also  to 
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calculate  the  logical  cardinality  being  discussed  in  the  next 
section. 

4.4.3  Cardinality  Constraint 

In  real  world  problems,  there  are  situations  where  the 
number  of  possible  simultaneous  disorders  is  limited  due  to 
various  reasons.  For  instance,  we  may  consider  only  the 
disorder  candidates  with  a single  disorder  because  of  the 
extremely  low  possibility  of  multiple  simultaneous  disorders 
in  some  domain.  Suppose  that  we  have  never  experienced  more 
than  two  simultaneous  disorders  in  a certain  domain.  Then  it 
is  unreasonable  to  handle  disorder  candidates  having  four  or 
five  simultaneous  disorders,  by  just  relying  on  the  theoretic 
model-based  causal  reasoning.  Thus,  we  use  a more  practical 
approach  to  solving  a multiple  disorder  problem  by  imposing 
the  cardinality  constraint  (the  maximum  allowable  multiple 
disorders  in  a disorder  candidate)  on  the  set  of  disorder 
candidates . 

Now  two  different  types  of  the  cardinality  constraints 
are  defined. 

Physical  cardinality.  The  cardinality  of  hypothesized 
disorder  candidate  is  the  number  of  the  multiple  simultaneous 
disorders  that  explain  all  the  observed  manifestations,  QM,  in 
the  best  way  (i.e.,  the  minimum  subset  of  the  covering  set). 
By  considering  the  characteristics  of  a certain  problem 
domain,  the  operator  may  limit  the  allowable  cardinality, 
called  "physical  cardinality".  We  eliminate  all  the  disorder 
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candidates  violating  this  physical  cardinality  constraint  if 
the  operator  chooses  this  option. 

Logical  cardinality.  In  some  problem  domains,  the 
disorder  candidate  consisting  of  two  simultaneous  disorders  is 
more  plausible  to  be  present  than  the  disorder  candidate  of  a 
occasionally  occurring  single  disorder.  Thus,  we  define  a new 
type  of  cardinality  constraint  limiting  the  number  of 
simultaneous  disorders  by  utilizing  the  occurrence  frequency 
of  each  disorder  pair  on  the  basis  of  the  concept  of  SOS 
(Strength  of  the  Supportive)  discussed  in  the  previous 
section. 


Definition  4.9 

If  C denotes  the  cardinality  value  which  has  been 
determined  in  the  model-based  candidate  generator,  the 
cardinality  value  is  revised  by  considering  the  disorder 
dependence,  called  "logical  cardinality": 


d = 


C 

2 x SOS 


where  C > 2 . 


By  introducing  this  new  concept  of  the  cardinality  value, 
we  can  eliminate  candidate  sets  which  have  a larger 
cardinality  than  the  users  has  specified.  Intuitively,  the 
revised  cardinality  value  is  more  reasonable  than  the  original 
one,  because  of  evaluating  the  relationship  between  multiple 
disorders  when  it  is  used  as  the  threshold  value  of  the 
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allowable  multiple  disorders. 

Therefore,  the  constraint-based  filter  eliminates  the 
computational  time  for  reasoning  non-plausible  candidates  by 
resolving  the  ambiguity  between  simultaneous  disorders.  The 
fuzzy  value  of  the  belief  factor  assigned  to  the  supportive 
(or  preventive)  relationship  can  be  utilized  to  decide  the 
maximum  cardinality  of  disorder  candidates  to  being  ranked  in 
the  next  stage  of  pattern-directed  ranker. 

In  the  multi-level  diagnostic  mechanism  proposed  in  this 
dissertation,  the  operators  are  allowed  to  choose  one  of  two 
cardinality  constraints  compensating  the  minimum  covering 
theory  defined  in  Definition  4.4  and  the  disorder  independence 
assumption. 


4.5  Pattern-directed  Ranker 

The  solutions  to  many  real-world  problems  require 
reasoning  under  uncertainty  (also  known  as  inexact, 
approximate,  or  evidential  reasoning) . In  the  multi-level 
diagnostic  reasoning  presented  here,  the  uncertainty  involved 
in  deciding  which  disorder  concept  is  the  most  plausible 
diagnostic  solution  that  explains  the  observed  manifestations 
in  the  best  way. 

The  pattern-directed  ranker  is  designed  to  differentiate 
a set  of  the  disorder  concepts  which  is  hypothesized  and 
focused  (filtered)  by  two  previous  diagnostic  reasoning 
operations,  namely,  the  model-based  candidate  generator  and 
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the  constraint-based  filter.  In  the  pattern-directed  ranker, 
the  PADIKS  approach,  developed  at  the  Center  for  Information 
Research,  is  revised  to  effectively  differentiate  the  most 
plausible  disorder  (disorders)  from  the  hypothesized  disorder 
concepts  and  to  rank  the  disorder  concepts  which  are  a small 
subset  of  the  initial  hypotheses  space.  Since  we  can  focus  our 
attention  on  a small  fraction  of  disorder  candidates,  the 
unreasonable  assumptions  which  have  widely  been  used  to  reduce 
the  computational  complexity  are  eliminated  for  more  general 
diagnostic  problem  solving.  Basically,  the  main  steps  in  the 
pattern-directed  approach  to  the  diagnostic  problem  solving 
are  (1)  to  formulate  the  query  pattern  vector,  xQ,  from  the 
observed  manifestations,  QM , (2)  to  retrieve  related  disorders 
pattern  vectors,  and  (3)  to  recognize  the  associated  disorder 
via  pattern  matching  and  classification  (Tou  [1982])  . In  order 
to  perform  these  steps,  the  knowledge  base  includes  the 
following  information: 

(1)  a list  of  relevant  manifestations  for  each  disorder. 

(2)  possible  values  of  these  manifestations  for  each  disorder. 

(3)  information  about  p(d()  and  p(xQ|d1- ). 

In  fact,  as  the  number  of  disorders  increases,  the  knowledge 
retrieval  and  the  classification  operations  involved  in  the 
pattern-directed  approach  require  a large  amount  of 
computation  time.  Fortunately,  unlike  the  previous  pattern- 
directed  approach  (or  PADIKS  approach) , our  multi-level 
diagnosis  mechanism  generates  a small  fraction  of  disorder 
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candidates  on  which  is  focused  for  the  differential  diagnosis 
in  the  pattern-directed  ranker  proposed  here. 

4.5.1  Bayesian  Approach  to  the  Pattern-Directed  Ranker 

Theoretically,  given  the  pattern-directed  inference,  all 
necessary  relevant  manifestations  observed  (i.e.,  the  query 
pattern  vector,  xQ,  p ( d f ) , and  p(xQ|dt.)  for  each  disorder,  then 
a minimum  probability  of  error  decision  can  be  made  on  the 
basis  of  Bayesian  theorem.  In  the  classical  Bayesian  approach 
to  the  pattern-directed  ranker,  the  posterior  probability  for 
disorder  classification  is  given  by 


P(d,-|xQ) 


P(d,-)  P^d,) 
P(Xq) 


(4.1) 


where  P(dt.)  denotes  the  priori  probability  for  disorder  df , and 
P(xQ)  is  the  probability  of  the  occurrence  of  xQ  that  is  a 
query  pattern  vector  generated  from  the  observed 
manifestations,  QM.  P(xQ|dj)  is  the  conditional  probability  of 
xQ  under  disorder  df.  In  fact,  this  approach  holds  true  only 
if  a single  disorder  is  assumed.  The  set  of  all  possible 
disorders  forms  a mutually  exclusive  and  exhaustive  set  of 
events , i.e., 

m 

d,-  n dj  = o,  p(d,-)  >o,  y.  p(di)  = i* 

i =1 
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However,  in  solving  real-world  problems,  we  may  meet  a 
multiple  disorders  problem  where  multiple  disorders  can  occur 
simultaneously.  In  principle,  a multiple  disorders  problem  can 
be  transformed  into  a single  disorder  problem  by  defining  a 
new  set  of  disorders  which  consists  of  all  the  2lDl  possible 
intersections  of  {dv  d2,  ...,  dm)  . The  combinatorial  explosion 
in  the  number  of  possible  sets  makes  this  approach  impractical 
as  the  value  of  m increases. 

Based  on  Ben-Bassat's  work  (Ben-Bassat  [1980]),  a 
multiple  disorders  problem  can  be  considered  as  the 
multimembership  classification  problem  referred  to  a situation 
in  which  the  object  to  be  classified  may  simultaneously  belong 
to  any  one  of  a given  set  of  disorder  classes,  w = (w,,  w2, 

. . . , wm)  or  to  none  of  them  (binary  choice  approach)  . By 
assuming  that  disorders  are  mutually  independent,  i.e.,  the 
membership  in  class  wf  (in  our  application,  disorder  d()  does 
not  affect  the  probability  of  membership  in  disorder  d;,  i * 
j,  the  posterior  probability  for  membership  in  disorder  d(  is 
given  by 


P(dil*a) 


P(d,-)  P(x0|di) 
P(Xo) 


P(d,)  P(xQ|dj) 

P(d,-)  P(*o|df)  + P(d,)  P(xQ|di) 

P(dj)  P(xQ[d1-) 

P(df)  P(Xq  I d,  ) + [1  - P ( dj ) ] P ( Xq  | dj ) 
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where  P(d;  ) denotes  the  probability  of  absence  of  disorder  d( , 
i.e.,  P(dj)  = 1 - P(df ) , and  P(xQ|dj)  is  the  conditional 
probability  of  xQ  under  the  absence  of  disorder  df . Using  this 
formulation,  the  multiple  disorders  problem  involving  2 1°  I 
classes  can  be  converted  into  a set  of  m problems  in  which 
problem  i is  concerned  with  membership  or  nonmembership  in 
df.  However,  this  approach  is  unable  to  handle  the  case  when 
xQ  is  present,  but  other  disorders  cause  xQ  while  df  is 
present . 

4.5.2  Disorder  Concepts 

In  order  to  directly  apply  the  Bayesian  formula  to  real- 
world  problems  where  multiple  disorders  may  occur 
simultaneously,  we  may  define  a new  set  of  disorder  classes 
consisting  of  all  the  2m  possible  intersection  of  {di;  d2,  ..., 
dm)  . In  practice,  however,  the  exponential  increase  in  the 
number  of  disorder  classes  makes  this  approach  unmanageable  as 
the  value  for  m increases.  Therefore,  we  define  "disorder 
concepts"  , the  set  of  mutually  exclusive  disorder  candidates, 
which  not  only  explain  all  the  observed  manifestations,  but 
also  satisfy  all  the  constraints  in  the  constraint-based 
filter.  As  discussed  in  Section  4.2,  a disorder  candidate  is 
the  set  of  disorders  which  explains  all  the  observed 
manifestations.  In  other  words,  the  disorder  concepts  are  a 
small  subset  of  all  possible  disorder  classes  which  pass 
through  two  previous  reasoning  operations,  i.e.,  model-based 
candidate  generator  and  constraint-based  filter.  Since  we 
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process  only  a small  fraction  of  all  possible  classes  in  this 
pattern-directed  ranker,  the  philosophy  of  disorder  concepts 
makes  it  computationally  tractable  to  rank  the  hypothesized 
candidates . 

We  divide  the  disorder  concepts  into  two  different  types 
of  concepts,  namely,  single  disorder  concept  (simple  concept) 
and  multiple  disorder  concept  (complex  concept) . 


Definition  4.10 

Single  disorder  concept  (or  simple  concept) , c*,  is  the 
event  that  d(  is  present  and  all  the  other  disorder  are 
definitely  not  present,  i.e., 


* j * 

c-  = d- 

i i 


- A dj  A dj  ...  A d^  A dj  A dj+i  ...  A dm 


dj  A ^eD^d*} 


where  D represents  a set  of  all  possible  individual  disorders 
and  the  superscript  * denotes  a disorder  concept. 

Definition  4.11 

Multiple  disorder  concept  (or  complex  concept) , C*  is  the 
event  that  all  dt. 's  which  belong  to  a set  of  disorders,  C]f  are 
present  and  all  the  other  disorder  are  definitely  not  present, 
i.e., 
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, A j \ 

( n / 

di€C, 


A ( 


A 


d^D-C, 


dk) 


where  | C* | > 2 . 

Since  c*  in  Definition  4.10  is  considered  as  a special  case  of 
C*,  we  can  use  Definition  4.11  as  the  general  definition  of 
both  simple  concept  and  complex  concept. 

In  fact,  the  mutually  exclusive  and  exhaustive  assumption 
made  in  the  classical  Bayesian  approach  implies  that  disorders 
are  mutually  dependent.  Disjoint  events  are  always  dependent 
since  the  occurrence  of  one  excludes  the  occurrence  of  the 
others.  Thus  the  disorder  concepts  are  mutually  exclusive  and 
statistically  dependent. 

Example  4 . 6 

Suppose  that  there  are  three  possible  disorders,  i.e., 
d1 , d2,  d3.  Based  on  the  definition  of  disorder  concepts,  the 
following  disorder  concepts  possibly  exist: 


dia2a3 
d1  d2  d3 

= d1a2d3 


cl^  d2  dj 

= d1d2a3 
= cT^  d2  d3 


C1,2,3  “ did2d3. 
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4.5.3  Strength  of  Causal  Association 

In  the  categorical  causal  model,  each  local  causal  model 
consists  of  D,  all  possible  disorders,  M,  all  possible 
manifestations,  and  R,  causal  relations  between  D and  M.  The 
causal  relation,  rf . , is  represented  by  binary  features, 
namely,  1 (true)  or  0 (false)  to  examine  the  existence  of 
causal  association  between  df  and  nr  for  the  hypothesized 
candidates  generation  in  the  model-based  candidate  generator. 
In  the  constraint-based  filter,  this  relation  is  represented 
by  three  levels  of  features,  i.e.,  significant  relevant 
feature,  insignificant  feature,  and  irrelevant  feature  for 
excluding  non-plausible  candidates.  However,  in  order  to 
provide  numeric  reasoning  of  uncertainty  in  the  pattern- 
directed  ranker,  the  strength  of  causal  association  (c;.) 
between  disorder  (d;)  and  manifestation  (m.)  should  be  more 
precisely  defined  to  perform  differential  diagnosis.  In  fact, 
the  strength  of  causal  association,  c,.,  can  be  referred  as  to 
the  conditional  probability  of  nu  under  d;  in  most  previous 
approaches . 

In  the  classical  Bayesian  theory,  the  strength  of  causal 
association  can  be  simply  defined  by 

c\y  = P(mj|df)  , (4.3) 

i.e.,  the  conditional  probability  of  manifestation  itu  when 
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disorder  d;  is  present.  Because  this  definition  explicitly 
assumes  that  the  disorders  form  a mutually  exclusive  and 
exhaustive  set  of  events,  it  holds  true  only  if  single 
disorder  problem  is  considered. 

Here  we  present  an  important  concept  of  the  causation 
event  which  is  developed  by  Peng  and  Reggia's  work  (Peng  and 
Reggia  [ 1987a ] ) . 

Causation  event.  For  any  ds  e D and  m^  e M,  d^nr 
denotes  the  event  that  df  actually  causes  m- . This  causation 
event  is  true  if  and  only  if  both  d;  and  nij  are  present  and  n^. 
is  actually  being  caused  by  dj , since  the  event  means  the 
presence  of  df  is  causing  the  presence  of  m^ . 

The  causation  event  is  not  equivalent  to  nij  and  df  both 
being  present,  since  the  latter  situation  could  happen  while 
some  other  disorder  is  causing  m.  in  a multiple  disorder 
problem.  Based  on  this  innovative  concept  of  a causation 
event,  the  strength  of  causal  association  is  defined  as 
follows : 


cf  j =P(dj-mj|d1) 

P (d^mj  A d,- ) 
= PTd^ 


(4.4) 


This  new  definition  of  c(. . reflects  the  average  frequency  with 
which  a given  d.  causes  m^ . c5j-  is  a better  measurement  of  the 
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strength  of  causal  association  than  the  conditional 
probability  PCnr  IdJ  used  in  the  Bayesian  approach.  For 
instance,  if  dj  cannot  cause  irij , P (d^nij  | d{ ) is  equal  to  0 
because  of  P(d1-->mj.)  = 0.  However,  clearly  P (m^  | df ) need  not  be 
0 when  multiple  simultaneous  disorders  may  occur  (in  a 
multiple  disorders  problem) , since  another  disorder  could 
cause  m.  while  dt-  is  present.  It  is  quite  easy  to  show  that 

P (dj-mj  | dj ) < P(mj|d1)  . 

A diagnostician  would  be  more  familiar  with  the  strength, 
cj?5,  with  which  a disorder  (d()  causes  a particular 
manifestation  (m^ ) , than  the  strength,  c^]5,  the  purely 
statistical  quantity  needed  in  the  standard  application  of 
Bayesian  formula.  However,  in  obtaining  the  probability  values 
statistically  from  the  previous  history  and  record,  it  is 
straightforward  to  obtain  P(mJ  |d). ) , while  Pfd^mj  | d( ) appears  to 
be  a rather  elusive  quantity  because  of  its  complication  due 
to  the  large  number  of  possible  multiple  disorders  candidates. 
Therefore,  we  define  a new  concept  of  the  strength  of  causal 
association,  which  will  be  utilized  in  the  pattern-directed 
ranker. 

Let  us  define  X,  the  context  of  dj-ntj,  to  be  a conjunction 
of  any  cause  and  causation  events  or  their  negations  other 
than  d^m.  and  its  negation.  Now  the  assumption  in  the  strength 
of  causal  association  is  formally  stated  as  follows: 
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Assumption  4 . 2 

The  strength  of  causal  association,  c^. , between  disorder 
dj  and  manifestation  nij  is  independent  of  any  of  its  contexts 
X given  that  d . has  occurred.  That  is, 

P(di-*mj  | d,AX)  = P(drmj  | d1 ) - (4.5) 


In  order  words,  c^.  is  a fixed  value  (ds  always  causes  nij  with 
a fixed  value  of  ci .) . This  assumption  is  quite  different  from 
the  manifestation  independence  assumption  used  in  most 
previous  approaches. 


Theorem  4 . 2 

For  any  dj  e D and  nij  e M, 


'i  j 


= P(mj  |d’) 

= P(d,-mj|di)  . 


(4.6) 


where  d*  stands  for  the  event  that  d{  is  present  and  all  the 
other  disorders  are  not  present. 


Proof  Let 


X=  A ^ 

dreD-df 


Then 
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d*  = X A d,- . 

P(*iK)  = P<dXDd|t"mi|d') 

= PCdj-itij  I d*) 

= P (dj-^rrij  | xAd,  ) 

= PCdj-itij  | df)  . 

The  last  equality  is  based  on  Assumption  4.2  because  X is  a 
context  of  d^m.  (Peng  and  Reggia  [1987a]). 

4.5.4  Pattern-Directed  Ranking 

Based  on  the  discussion  in  the  previous  sections,  now  we 
present  a new  differential  reasoning  procedure,  in  fact,  a 
modified  version  of  the  PADIKS  approach,  for  ranking  the 
hypothesized  disorder  concepts.  The  relative  likelihood  of  a 
disorder  concept  given  QM  can  be  determined  by  the  following 
equation: 


P(C,*|Qm) 


P(Qh|C,*)  p ( Cj  ) 
P(Qm)  • 


(4.7) 


Since  the  denominator  of  Equation  4.7,  P(QH),  is  a constant 
for  any  given  QM , the  likelihood  function,  L(C*,  QM)  , is 
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l(c;,  qm)  = p(q„|c;) p(c;) . 


(4.8) 


By  assuming  the  disorder  independence,  P(C*)  is  computed 
as  follows: 


P(C,*)  = P(( 


A 

d,-€c; 


d,)  A ( 


A 

dkeD-c: 


ak)) 


n 

di  eC 


.P(di) 


n 

eD-C 


.d-P(dk)  ) . 


(4.9) 


Lemma  4 . 3 

If  the  hypothesized  disorder  candidate  C*  c D and  Mj  e M, 

then 

P(mj  | C*)  = (1-  n (i-Cjj)).  (4.10) 

J dieCi 

Proof  Let  C*  = {du  d2,  ...,  dr)  = C*.  By  assuming  the 
causality  independence  and  disorder  independence, 

P(mj  | C’)  = P(C-mj  | Cr)  . 

where  Cr  = d.,Ad2A  . ...,Adp  = Cj . The  proof  is  by  induction  on  r. 
Base  case:  If  r = 1,  P ( d^mj  | &, ) = c^.  = 1 - (1  - c^.) 

Induction  step:  Assume  the  lemma  holds  for  any  r > 1.  Now  show 
that  it  holds  for  r + 1. 
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P ( Cr+i_*'iiij  | Cr+1)  - P(Cr->mJ-  | Cr)  + P(dr+1-+mJ-  | dr+i) 

- P(Cr-*mj  | Cr)  . P(dr+1^mj  | dr+1) 

r r 

= (1-T"]  + cr+1,j  “ ( ^ - T1  (^--cij))  * cr+1,j 

1=1  1=1 

r+1 

= 1 - T]  (l-cu). 

1 = 1 


From  (4.10) , 


P(ffij|Cj) 


n 

d,eC 


.(1  * c,j) 


For  the  conditional  probability  P(QM|C*)  where  QM  is  given,  we 
thus  have  the  following  result: 


P(S,ICD  - P((4o.mi> 


A ( 


^ . mi) 

% eM-QH  1 


c,*) 


. n 

mj€<2M 


P(Dlj|CI*) 


n 

mleM-QM 


P(ffii|C,*) 


. n 

mj€Q„ 


(1- 


n 

di€c; 


(1-C|j)) 


n n 

mteM-Qn  d,-eC* 


( 1 ci j ) • 


By  considering  the  effect  of  disorder  dependence  in  a multiple 
disorders  problem,  the  concept  of  SOS  (strength  of  the 
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supportive)  developed  in  Section  4.4  is  utilized  to  derive 
more  plausible  concept  of  the  likelihood  measure  as  follows: 


l/(C;,  Q„)  = SOS-  L(C;,Qm) 

= SOS-  P(Q„|c;)  P ( Cj*)  . 


(4.11) 


Based  on  the  definition  of  a likelihood  function,  the 
pattern-directed  ranker  performs  the  following  procedures  for 
ranking  the  hypothesized  disorder  concepts: 

(1)  Retrieve  a set  of  disorder  concepts,  C*,  which  is 
hypothesized  in  the  model-based  candidate  generator  and 
focused  in  the  constraint-based  filter. 

(2)  Retrieve  a set  of  observed  manifestation,  QM . 

(3)  Retrieve  all  the  probabilities,  i.e.,  prior 

probabilities,  P(dj)'s,  and  all  the  strengths  of  causal 
associations,  Cjj 's,  where  d. 1 s belong  to  all  the 
disorders  consisting  C*'s. 

(4)  Compute  P(C*)  , P(QM|C*)  based  on  P(df)'s  and  c^.'s. 

(5)  Compute  L(C*,Qm)  and  L'(C*,QM). 

(6)  List  the  disorder  concepts  with  their  likelihood 
measurements  and  explanations. 

4.6  Summary  and  Discussions 

In  this  chapter,  we  have  presented  a novel  diagnostic 
reasoning  approach,  called  "multi-level  diagnostic  mechanism", 
which  consists  of  three  distinctive  diagnostic  operations, 
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that  is,  the  model-based  candidate  generator,  the  constraint- 
based  filter,  and  the  pattern-directed  ranker.  This  layer 
architecture  also  has  many  advantages  over  most  of  the 
previous  mixed-type  ones  in  expanding,  modifying,  and  updating 
its  reasoning  scheme  and  knowledge  base.  In  this  work,  we 
focus  our  attention  on  the  strategy  of  reducing  the  effect  of 
approximate  reasoning  (or  uncertainty  reasoning)  by,  first, 
relying  on  the  structural  symbolic  reasoning  and  the  ambiguity 
resolution . 

In  fact,  we  notice  that  the  differential  diagnosis  in  the 
pattern-directed  approach  involves  not  only  uncertainty  in 
extracting  values  for  various  probabilities  (especially  as  the 
number  of  possible  disorders  increases) , but  also  a great 
degree  of  computational  complexity  in  ranking  the  hypothesized 
candidates.  This  is  why  we  propose  a new  approach  to 
diagnostic  reasoning  which,  first,  reduces  the  candidates 
space  based  on  the  categorical  casual  model  and  some 
constraints  instead  of  directly  applying  the  PADIKS  approach 
to  whole  possible  candidate  space.  Here  we  summarize  the 
overall  diagnostic  reasoning  procedure,  especially  emphasizing 
how  to  effectively  reduce  the  hypotheses  space  (initially  very 
large  when  a multiple  disorders  problem  is  considered)  without 
unreasonable  assumptions,  before  finalizing  the  solutions  (or 
the  ranked  plausible  candidates) . 

In  solving  a general  diagnostic  problem,  let  us  consider 
the  set  of  all  possible  disorders,  represented  by  D = 
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{d1 , d2,  . . . , dt } , and  the  set  of  all  possible  manifestations, 
represented  by  M = {m1  ,m2, . . . , mk} . When  a multiple  disorders 
problem  is  considered,  a new  set  of  disorder  classes,  CQ, 
consisting  of  2 lD  I (i.e.,  2l)  possible  intersection  of 
(d^dj,  . . . ,dt}  is  defined,  i.e.,  CD  = { C1 , C2,  . . . , Cn}  where  n = 
2l.  For  instance,  when  1 = 2,  we  get  {d1f  d2,  d1Ad2,  a).  The 
initial  hypotheses  space  consisting  of  all  possible  disorder 
classes,  CD,  is  usually  very  large  so  that  it  is 
computationally  intractable  to  directly  apply  Bayesian  formula 
to  CD.  The  model-based  candidate  generator  employing  the 
categorical  causal  model  and  the  set  covering  theory  forms  a 
small  fraction  of  CD,  namely,  a set  of  disorder  candidates,  C+, 
which  explains  all  the  observed  manifestations.  Thus  we  focus 
on  this  new  set  of  disorder  candidates  for  the  further 
reasoning  process.  Through  filtering  operation  in  the 
constraint-based  filter,  the  hypotheses  space  is  further 
reduced.  We  call  members  of  this  shr inked  hypotheses  space 
"disorder  concepts",  C*,  which  are  disorder  classes  fulfilling 
both  the  model-based  structural  reasoning  and  the  constraint- 
based  ambiguity  resolution.  Upon  the  set  of  disorder  concepts, 
the  pattern-directed  ranker  performs  a differential  diagnosis 
to  solve  the  uncertainty  in  finding  the  most  plausible 
disorder  concepts  and  to  rank  a set  of  disorder  concepts.  The 
final  diagnostic  solution  consists  of  a set  of  the  ranked 
disorder  concepts  as  in  the  following  form: 
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(c\  l)  = {(c;,  l,) , (c;,  i^) , . ..,  (c*,  l,),  . ..,  (c;,  ls)>. 

where  C*  represents  the  ith  disorder  concept,  Lt-  represents  the 
likelihood  measure  of  C*  causing  all  the  observed 
manifestations, and  s represents  the  number  of  the  most 
plausible  disorder  concepts  being  recommended  to  the 
operators . 

The  multi-level  inference  discussed  here,  we  believe, 
provides  an  efficient  diagnostic  reasoning  for  various 
applications  including  multiple  disorders  problems  and  focuses 
on  the  reduced  hypotheses  space  to  recommend  operators  the 
most  plausible  disorder  with  control  directives.  Figure  4.9 
depicts  how  the  proposed  approach  finalizes  a set  of  the  most 
plausible  disorder  candidates  from  the  initial  hypotheses 
space. 


4.7  Conclusions 

We  have  introduced  a new  approach  to  diagnostic  problem 
solving  by  integrating  probabilistic  pattern-directed 
inference  with  a symbolic  model  of  diagnostic  reasoning  based 
on  a content-associative  causal  network.  The  categorical 
causal  model  is  combined  with  a formal  model  of  diagnostic 
inference  referred  to  as  PADIKS  (Pattern-Directed  Knowledge- 
Based  System) . Combined  with  set  covering  theory,  the 
categorical  causal  model  is  utilized  to  eliminate  some 
difficulties  to  the  direct  application  of  the  PADIKS  approach 
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H : Initial  hypotheses  space. 

CD  : Possible  disorder  candidates  space,  2 1° I possible 

intersection  of  {d1  ,d2,  . . . ,dt } . 

C+  : Disorder  candidates  space  explaining  QH. 

C*  : Disorder  concepts  space  excluding  all  Ct's  violating 

the  constraints  in  the  constraint-based’  filter. 

S : Ranked  disorder  concepts  space  excluding  all  C* ' s which 

are  below  the  threshold  value  in  the  likelihood  measure. 


Figure  4.9  Hypotheses  Space  Reduction  in  the  Proposed  Multi- 
Level  Diagnostic  Reasoning 
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to  general  diagnostic  problems,  such  as  the  incredibly  large 
number  of  probabilities  required  as  part  of  the  knowledge 
base,  and  the  combinatorial  explosion  of  diagnostic  hypotheses 
when  multiple  disorders  are  possible.  Conversely,  the  PADIKS 
approach  strengthens  the  categorical  causal  model  by  searching 
the  most  plausible  hypotheses  and  ranking  hypotheses. 
Furthermore,  the  concept  of  a constraint-based  filter  is 
introduced  by  utilizing  three  important  constraints  not  only 
to  further  reduce  the  computational  complexity,  but  also  to 
resolve  the  ambiguity  involved  in  multiple  disorders  problems. 
The  expected  feature  constraint,  which  is  based  on  the  concept 
of  partitioning  the  feature  space,  is  to  justify  the  disorder 
candidates  hypothesized  in  the  model-based  candidate 
generator.  Also,  the  disorder  dependence  constraint  eliminates 
disorder  candidates  which  have  'preventive'  relationships  and 
gives  a credit  to  ones  which  have  'supportive'  relationships. 

This  multi-level  diagnostic  reasoning  mechanism,  which 
consists  of  the  model-based  candidate  generator,  constraint- 
based  filter,  and  pattern-directed  ranker,  provides  a highly 
efficient  diagnostic  procedure  and  generates  more  realistic 
expert  behavior. 


CHAPTER  V 

DSS/DPC  : KNOWLEDGE-BASED  DECISION  SUPPORT  SYSTEM 
FOR  FAULT  DIAGNOSIS  AND  PROCESS  CONTROL 

5.1  System  Organization 

In  the  previous  chapters,  a new  scheme  of  the  knowledge- 
based  system  for  diagnosis  in  a distributed  process 
environment  was  designed  to  perform  two  primary  tasks:  (1) 

data  integration  to  generate  well-defined  data  hierarchy  from 
the  occasionally  erroneous  data  observed  in  a distributed 
process  environment,  and  (2)  multi-level  diagnostic  inference 
for  the  automatic  process  control  on  the  basis  of  the  observed 
manifestations  and  experts'  knowledge.  These  two  tasks  are  the 
main  technical  concerns  in  this  dissertation. 

Based  on  various  diagnostic  strategies  discussed  in 
Chapter  III  and  IV,  the  DSS/DPC,  Knowledge-based  Decision 
Support  System  for  Fault  Diagnosis  and  Process  Control,  is 
developed  to  support  the  quality  assurance  in  an  automated 
manufacturing  environment  by  identifying  all  the  defects  (or 
manifestations)  observed  from  the  shopfloor,  by  diagnosing  the 
defect  causes  on  the  basis  of  all  the  observed  defects,  and  by 
providing  troubleshooting  procedures.  We  chose  a PCB 
manufacturing  operation  (in  fact,  a PCB  assembly  operation)  as 
our  problem  domain,  because  its  subprocesses  are  highly 
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distributed  and  automated  and  because  we  can  easily  collect 
the  domain  knowledge  and  test  data  from  industries.  As 
illustrated  in  Figure  5.1,  the  DSS/DPC  performs  the  fault 
diagnosis  and  supervisory  control  on  the  basis  of  the  test 
data  from  the  test  workcells  via  the  data  collection  system. 
In  developing  the  prototype  DSS/DPC,  we  focus  on  the 
utilization  of  test  data  to  find  the  most  plausible  defect 
causes  and  to  provide  repair  procedures  and  control 
procedures . 

Figure  5.2  depicts  the , primary  software  modules  in  the 
prototype  DSS/DPC.  As  shown  in  Figure  5.2,  the  DSS/DPC 
consists  of  three  subsystems;  (1)  data  integration  subsystem 
(DIS) , (2)  decision  support  subsystem  (DSS) , and  (3)  operator 
interface  subsystem  (OIS) . As  discussed  in  Chapter  III,  the 
data  integration  subsystem  mainly  performs  four  different  data 
operations,  i.e.,  data  normalization,  data  verification,  data 
validation,  and  data  association,  on  the  test  data  in  the  raw 
database  to  form  an  error-free  categorical  database.  The 
decision  support  subsystem  provides  various  tools  such  as 
statistical  analysis,  diagnostic  reasoning,  repair  procedure, 
and  process  control  procedure,  to  help  the  operators  make 
correct  decisions  on  the  process  control.  The  operator 
interface  subsystem  is  designed  to  provide  the  operators  with 
a user-friendly,  interactive  communication  channel  to  the 
DSS/DPC. 


149 


: Material  Flow 


: Information  Flow 


Figure  5.1  DSS/DPC:  Knowledge-Based  Decision  Support  System 
for  Fault  Diagnosis  and  Control  in  a Manufacturing 
Operation 
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Figure  5.2  Three  Primary  Subsystems  in  the  DSS/DPC 
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5.2  System  Development  and  Integration 

In  the  DSS/DPC,  test  data  are  considered  only  the 
possible  observations  (or  manifestations)  collected  from 
various  test  workcells  in  a PCB  manufacturing  operation.  Thus 
we  process  a set  of  test  data  which  consists  of  several  data 
attributes  needed  for  specifying  the  observed  defect.  These 
test  data  are  provided  by  a PCB  manufacturing  company,  which 
are  gathered  by  the  data  collection  system  depicted  in  Figure 
1.3. 

Figure  5.3  (a)  shows  the  test  data  in  the  raw  database. 
These  data  are  rearranged  for  illustration  as  shown  in  Figure 
5.3  (b)  , which  consist  of  OBS  (observation  number),  PDATE 
(processing  date) , CH  (multiplexer  channel  number) , BADGE 
(operator  badge  number) , WO  (workorder) , PCB_ID  (PCB 
identification  number) , OP_NO  (operation  number) , DEFECT_ID 
(defect  identification  code) , NO  (number  of  defects) , LOCATION 
(defect  location) , and  DAYTIM  (date  and  time) . 

5.2.1  Data  Integration  Subsystem 

In  a PCB  manufacturing  operation,  technicians  at  various 
test  workcells  enter  test  data  into  the  raw  database  via  the 
data  collection  system.  After  carefully  examining  these  test 
data  in  the  raw  database,  we  find  many  erroneous  data  due  to 
wrong  data  entry,  wrong  unit,  wrong  code,  and  so  on.  The  data 
integration  subsystem  performs  four  different  data  operations 
to  form  the  error-free  test  data  and  to  generate  the  well- 


OBS  PDATE  CH  BADGE  WO  PCB  ID  OP  NO  DEFECT  ID  NO  LOCATION  DAYTIM 
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Figure  5.3  PCB  Test  Data  in  the  Raw  Database  (a)  Part  of  the  Raw  Database 
(b)  Part  of  the  Raw  Database  Rearranged  for  Illustration 
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structured  category  file  on  the  basis  of  the  entity-category- 
relation  structure  discussed  in  Section  2.3.2. 

The  data  integration  operation  can  simply  be  represented 
as  follows: 

FOR  i=l  TO  N ; for  each  test  data 
NORMAL  ( Dj ) - { Di  j- } ; 

FOR  j=l  TO  M 

VERIFI  ( Djj ) - [ D j j , ER(k)] 

VALIDA(Djj , ER  (k)  ) - 5fj 
ASSOCI  ( 5^ ) - [5fj,  CC] 

END  j ; 

END  i; 

Since  we  discussed  the  details  in  Chapter  III,  the  data 
integration  operations  are  briefly  introduced  in  terms  of  the 
program  structure.  NORMAL  (Dt- ) is  the  data  normalization 
operator  on  the  test  data  D; , which  may  be  partitioned  into 
several  test  results  to  make  all  the  attributes  atomic. 
VERIFI (Djj)  is  the  data  verification  operator  on  the  normalized 
test  data  D-. , generating  the  error  flag  ER (k)  if  erroneous 
attributes  are  found,  k denotes  the  error  code.  The  data 
validation  operator,  VALIDA (Dfj. , ER(k)  ) , is  to  correct  or 
delete  the  erroneous  test  data  via  the  interactive  operator 
interface  subsystem.  The  error-free  test  data,  Bfj. , are 
categorized  by  the  data  association  operator,  ASSOCI ( f - ) , and 
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located  in  the  content-associative  hierarchy  by  the  category 
code,  CC.  These  operators  are  based  on  the  entity/attribute 
dictionary,  the  value  dictionary,  and  the  categorization 
procedures  presented  in  Chapter  III. 

5.2.2  Decision  Support  Subsystem 

In  the  DSS/DPC,  the  decision  support  subsystem  mainly 
aims  to  provide  the  operators  with  various  information  to 
enhance  the  product  quality  in  a manufacturing  operation.  The 
primary  information  for  quality  assurance  provided  by  this 
subsystem  is:  (1)  defect  identification,  (2)  defect  removal 
method,  (3)  defect  cause  identification,  and  (4)  defect  cause 
removal  method.  The  defect  identification  is  to  list  all  the 
observed  defects  (manifestations)  with  their  statistical 
properties.  The  defect  removal  method  is  the  touch-up  strategy 
to  remove  the  observed  defect(s).  It  is  different  from  the 
defect  cause  removal  method  which  is  to  remove  the  cause (s)  of 
the  observed  defect(s)  instead  of  the  defect  itself,  based  on 
the  diagnostic  result.  The  defect  cause  identification  is  to 
hypothesize  all  the  disorder  candidates  (or  defect  causes) 
which  explain  all  the  observed  defects. 

To  effectively  provide  these  information,  three  primary 
software  modules  are  designed,  i.e.,  the  statistical  analysis 
module,  the  multi-level  diagnosis  module,  and  the  repair  and 
control  procedures  module.  The  data  structure  for  the  decision 
support  subsystem  consists  of  CNIT  (categorical  node  index 
table) , CMD  (causal  model  dictionary) , CMDT  (causal 
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model/disorder  table)  , DD  (disorder  dictionary)  , DDT  (disorder 
dependence  table) , DMRT  (disorder/manifestation  relation 
table) , MD  (manifestation  dictionary) , CDGT  (control  directive 
generator  table) , and  RPT  (repair  procedure  table) , as  shown 
in  Figure  5.4.  The  CNIT  plays  a significant  role  in  this 
subsystem.  It  represents  a knowledge  sketch  base  in  an 
associative  tree  structure.  The  MDs ' are  designed  to  represent 
all  the  possible  defects  in  a PCB  manufacturing  operation,  as 
shown  in  Figure  5.5.  In  fact,  among  those  defects  listed  in 
Figure  5.5,  13  solder  defects  and  24  defect  causes  are  coded 
in  the  data  structures  shown  * in  Figure  5.4. 

5.2.3  Operator  Interface  Subsystem 

The  operator  interface  subsystem  is  designed  to  provide 
the  interactive  communication  channel  between  DSS/DPC  and  the 
operators.  This  subsystem  consists  of  several  functional 
modules.  Among  those  modules,  we  present  three  primary 
software  modules,  namely,  (1)  menu-driven  command  interpreter 
module,  (2)  explanation  facility  module,  and  (3)  visualization 
module. 

Menu-driven  command  interpreter  module.  This  software 
module  provides  the  mapping  function  between  the  operator- 
chosen  menu  and  the  corresponding  software  modules  as  well  as 
the  display  of  the  multi-level  interactive  menu. 

Explanation  facility  module.  This  module  aims  to 
provide  the  operators  with  English-like  messages  to  explain 
the  results  of  the  DSS/DPC  operation  and  to  provide  the 
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010.. ..  Adhesion,  Tape/Oecal/Label 

020.. ..  A1 igrment /Adjustment /Binding 

030.. ..  Coating,  Conformal 

031.. . Wrong  Location 

032. . .Missing  Coating 

033. . .flak Ing/Pee ling 

034. . .excessive  Bubbling 

035.. . excessive  Coating 

040.. ..  Contamination 

041. . .foreign  Material 

042..  .flux 

043.. . Masking  Material 

050.. ..  Insulation,  Damaged/Ml sslnq 

051 .. .excessive  Insulation  Gao 

052. . .1.sufficient  Insulation  Gao 

053. . .Missing  Insulation 

054. . .Meta I Case  Component  Mounted  Over  Circuit 

Path  (not  Insulated) 

055.. . Glass-cased  Components  not  Protected  by 

Buffer  Material 

056.. . Component  not  Besting  on  feet.  Pads  or 

Projections 

057. . .Charing,  Burning  or  Oamage  to  Insulation 

(affecting  electrical  operation) 

058.. . Charing,  Burning  or  Oamage  to  Insulation 

(not  affecting  electrical  operation) 

060.. ..  Installation 

061.. . Wrong  Location 

062. . .Missing  Component 

063.. . Wrong  Polarity 

064.. . Part  hot  Seated 

065.. . 1.properly  Mounted 

066.. . 5.ort  Leads 

067.. . Long  Leads 

068.. . Leads  Improperly  Bent  (Formed) 

069.. . Component  Improperly  Spaced 
06A... Possible  Component  Abrasion 

068.. . Component  Mot  Centered 
06C... Improper  Lead  Wrap 

060. .  .Comoonent  has  Improper  Clearance  above  PV8 
06E... Component  not  Sufficiently  Supported 

06f ...Comoonent  not  Mounted  Perpendicular 
06G... Improper  Vertical  Mounted  Component  Clearance 
06M... Component  Obscures  Termination  of  Another 
Component 

06 1.. . Comoonent  has  Improper  Clearance 
06J...Long  Leads  (electrical  clearance  sufficient) 
06*... Improper  Lead  Bend  Radius 

061.. . 1.proper  Lead  Bend  Clearance 
06M... Improper  Stress  Relief 
06M...Lead  Cut  after  final  Solder 
06P... Insufficient  Lead  Clinch  (non  PTH) 

060.. . Excessive  Wlckfng 

068. . .Lead  Clinch  Beyond  Allowable  Limits 

(electrical  clearance  Insuff Iclent ) 

065. . .Excessive  Terminal  fill 

06T... Component  Less  Than  1/16  Inch  to  Edge  of 
PV8  Causing  Interference 

060.. . Component  Less  Than  1/16  Inch  to  Edge  of  PV8 

not  Causing  Interference 
06V... Improper  Terminal  Swagglng 

070.. ..  Narking 

071. . .Wrong  Location 

072. . .Missing  Marking 

073. . .111.gible  Marking 

080..  ••  Paint 

081..  .Wrong  Location 

082.. . Missing  Paint 

083. . .Damaged  Paint 

084.. . Wrong  Shade/Color 

090.. ..  Pins 

091. . .Bent  Pins 

092.. .M1ss1ng  Pin 

093.. . Damaged  Pin/Crimp 

094.. . Pin  Not  Seated 

100.. ..  Voids,  Weld/Solder 

110..  ••  Weld,  Improper/Cracked/Missing 

120.. ..  Wire,  Wrap/8ond1ng 

130.. ..  Wiring 

131. . .Wrong  Location 

132. . .Missing  Wire 

133.. . Damaged  Wire/Broken 

134.. . Reversed  Wire 

135.. .81.dcag1ng 

136..  .No  wire  In  PTH  (hand  solder) 

137..  .Cut  or  Nicked  Leads  or  wires 

138..  .Leads  or  wires  Scraped  Exposing  Basis  Metal 

or  Stretched 

139.. . Leads  or  Wires  Scraped  Not  Exposing  Basis 

Metal  or  Stretched 
13A... Stranded  Wire  Lay  Olsturbed 


1<0«...  Wrong  Component 

141. . .Type 

142..  .value 

143.. . Materia  I 

150.. ..  Defective  Vendor  Material 

160.. ..  Damaged 

161. . .Chipped  Component 

162.. . Cracked  Component 

163. . .Broken  Component 

164.. . Meas ling  or  Crazing  After  Solder 

165. . .Damaged  PV8 

166. . .Delamination  of  PV8  Base  Material 

167. . .Measles  (Reference  QA8  76) 

168..  .Pit,  Scratches  or  Inclusions 

169..  .Blisters  In  Conductor  Pattern 
16A... Wrinkles  In  Conductor  Pattern 
16B...Iacroper  Hole  Size 

16C... Burned,  Scorched  PV8  or  Parts 

160.. . Dewetting  on  PW8  Conductor  Paths 

(areas  which  are  not  part  of  a solder  Joint) 
16E... Improper  Solder  Coating 
16F... Silver  on  Circuit  Path 
16G... Meas ling  or  Crazing  (bare  PV8) 

170.. ..  Solder 

171. . .501.er  Bridges 

172. . .Missing  Solder 

173.. . fractured  Solder  Joint 

174. . .Excess  Solder,  Lead  Not  Olscemable 

175.. . 1.sufficient  Solder 

176.. .P1nholes/P1ts/Vo1ds 

177. . .501.er  Bubbles 

178. . .Cold  Solder  Joint 

179.. . Poor  Wetting 

17A...0ewett1ng  of  Solder  Connection  Areas 

178. . .Rosin  Joint 

17C... Solder  Not  Smooth  and  Shiny 

1 70.. . 0.erheated  Solder 

17E... Inclusion  In  Solder  Connection 
17F. ..Visible  Basis  Metal  In  Solder  Connection 
17G...Lack  of  Solder  Coverage  on  Lead  Ends 
17H... Solder  Splatter 

171. . .501.er  In  Send  Radius 
17J... Unfilled  PTH  (wave  soldered) 

17*. ..Excess  Solder,  Lead  Olscemable 

171.. . .501.er  Points,  Peaks  or  Icicles 

180.. ..  Documentation 

181.. . Configuration 

182.. . Tags 

1 83. .  .Process/Ins tr/Owg/Proc/EN 

190.. ..  Eoulpment/Process 

195.. .. Machine  Set-up/Evaluation 

200.. ..  Out  of  Tolerance 

201.. . 5.orted 

202.. . 00.n 

203. .  . 1.terml ttent 

204.. . Voltage 

205.. . frequency 

206.. . No  Output 

210.. ..  Olstorted,  Warped/Bent 

211.. . Excessive  Warp  or  Twist  (bare  PV8) 

212.. . Excessive  Warp  or  Twist  (after  soldering) 

220.. ..  Error,  Operator/Inspector 

230.. ..  film  Oata 

240.. ..  finish.  Micro 

250.. ..  Hardware 

251.. . Wrong  Har<h*ere 

252.. . Missing  Hardware 

253.. . Damaged  Hardware 

254.. . Loose  Hartoare 

260.. ..  Invalid  Reject 

270.. ..  Lot  Code,  Mlsslng/11 legible 

2 80.. ..  Packaging/Packing 

290.. ..  Pad,  Lifted  (after  soldering) 

291.. . 5.paration  of  Conductor  Pattern  (bare  PVB) 

292.. .. Circuitry,  Damaged 

293.. . Ribbon/FI ex  Cable 

294.. . Pattern  Oclamlnated  (after  soldering) 

300.. ..  Plated  Thru  Hole,  Mlsslng/Oamaged 

310.. ..  Plating,  Anodizing,  Irldlte,  Oamaged,  Missing 

320.. ..  Porosity 

330. . ..  Potting 

340.. ..  Shelf  Life,  Omitted/Expired 

350.. ..  Solderablllty 

360.. ..  Staklng/f larlng 

370.. ..  Terminal  or  Eyelet,  Not  Seated/Cracked/ 

Loose 


All  the  Possible  PCB  Defects  with  Their 
Classification  Codes 


Figure  5.4 
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CNIT  (Categorical  Node  Index  Table) 


Cateqorv 

Pointer 

Pointer  to 

# of 

1st  Subnode 

2nd  Subnode 

Code 

to  CMD 

Parent  Node 

Subnodes 

Address 

Address 

CMD  (Causal  Model  Dictionary) 


CM 

Pointer 

Pointer 

# of 

Pointer 

Name 

to  CMDT 

to  CDGT 

Disorders 

to  CNIT 

CMDT  (Causal  Model/Disorder  Table) 


# of 

Pointer  to 

Disorder 

Pointer  to 

Disorder 

Disorders 

1st  DD 

Prob. 

2nd  DD 

Prob. 

DD  (Disorder  Dictionary) 


Disorder 

Pointer 

Pointer 

# of 

Pointer  to 

Pointer  to 

Name 

to  DMRT 

to  DDT 

Assoc.  CM 

1st  CM 

2nd  CM 

DDT  (Disorder  Dependence  Table) 


# of 

Pointer  to 

Strength  of 

Pointer  to 

Strength  of 

Disorders 

1st  DD 

Supportive 

2nd  DD 

Supportive 

DMRT  (Disorder/Manifestation  Relation  Table) 


# of 

Pointer  to 

Causal 

Pointer  to 

Causal  | 

Manifest. 

1st  MD 

Strength 

2nd  MD 

Strength 

MD  (Manifestation  Dictionary) 


Manifest. 

Pointer 

# of  Assoc. 

Pointer  to 

Pointer  to 

Name 

to  RPT 

Disorder 

1 st  Dsrdr 

2nd  Dsrdr 

CDGT  (Control  Directive  Generator  Table) 


# of 

Pointer  to 

Control 

Pointer  to 

Control  1 

Disorders 

1st  DD 

Procedure 

2nd  DD 

Procedure! 

RPT  (Repair  Procedure  Table) 


# of 

Pointer  to 

Repair 

Pointer  to 

Repair 

Manifest. 

1st  MD 

Procedure 

2nd  MD 

Procedure 

. . . 

Figure  5.5  Data  Structures  of  the  Decision  Support  Subsystem 
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recommendation  for  repair  procedures  and  control  procedures. 
Especially,  we  develop  three  explanation  facilities  for:  (1) 
the  repair  procedures  for  the  observed  manifestations,  (2)  the 
diagnostic  reasoning  procedures  and  their  results,  and  (3)  the 
control  procedures  for  the  hypothesized  disorder  candidates. 

Example  5.1 

Suppose  we  observe  the  solder  defect,  {dewetting},  which 
is  often  found  in  a PCB  manufacturing  operation.  The 
explanation  facility  for  repair  procedures  generates  the 
following  explanation  to  help  the  operators  perform  the 
corrective  action: 

Strip  the  solder  off  the  surface  and  clean  underneath 

before  resoldering  by  using  the  following  steps. 

(1)  Chemical  stripping  with  a solution  that  does  not 
attack  the  copper  or  board  materials. 

(2)  Leveling  the  board  and  resoldering  it  with  an 
aggressive  flux. 

(3)  For  component  leads  or  terminations,  multiple 
dipping  or  passes  through  the  wave  are  effective. 

Visualization  module.  The  visualization  module  is  to 
provide  various  tools  to  graphically  represent  the  retrieved 
information,  which  aids  the  operators'  comprehension  and 
conceptualization.  This  software  module  allows  the  operators 
to  choose  the  appropriate  graphical  representation  schemes  via 
the  interactive  interface.  In  the  prototype  DSS/DPC,  various 
graphical  outputs  such  as  vertical  bar  chart,  horizontal  bar 
chart,  block  graphs,  pie  graphs,  and  star  graphs,  are  designed 
to  represent  the  statistical  information  such  as  the 
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manifestation  distribution  and  the  disorder  distribution. 

5.3  Development  Environment 

To  facilitate  system  development,  the  powerful  software 
package,  SAS,  was  employed  as  the  system  kernel.  SAS  seems  to 
be  an  adequate  software  tool  for  the  development  of  the 
DSS/DPC,  because  it  provides  various  utilities.  Those 
utilities  include  (1)  information  storage  and  retrieval,  (2) 
data  modification  and  programming,  (3)  report  generation,  (4) 
statistical  analysis,  (5)  file  management,  and  (6)  graphical 
information  representation.  In  fact,  to  use  the  DSS/DPC  for 
real-time  applications  (or  quicker  solutions) , we  should 
employ  a fast  multi-tasking  operating  system  as  the  system 
kernel  instead  of  SAS.  In  addition  to  its  limited  processing 
speed,  SAS  has  some  drawbacks  such  as  the  lack  of  debugging 
capability.  However,  we  choose  SAS  as  the  system  kernel  of  the 
prototype  DSS/DPC  because  it  provides  not  only  diverse 
utilities  and  library  functions,  but  also  the  development 
environment  for  the  prototype  system  which  is  enough  to 
demonstrate  the  feasibility  of  the  proposed  approach.  The  use 
of  the  SAS  software  tremendously  shortens  the  development 
cycle  of  the  prototype  DSS/DPC  due  to  its  built-in  utilities. 

5.4  Experimental  Results 

The  prototype  DSS/DPC  has  been  implemented  in  80386-based 
IBM-PC  with  the  SAS  environment.  As  shown  in  Figure  5.4,  the 
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139  different  types  of  PCB  defects  are  defined  and  categorized 
in  the  data  integration  subsystem.  The  data  integration 
subsystem  has  successfully  generated  the  defect  category  file 
from  an  erroneous  raw  database  which  was  provided  by  a PCB 
manufacturing  company.  Among  several  defect  subcategories,  the 
diagnostic  mechanism  has  been  applied  to  only  one  subcategory, 
i.e.  a set  of  the  wave  solder  defects.  The  causal  associations 
between  13  solder  defects  (manifestations)  and  24  defect 
causes  (disorders)  shown  in  Figure  5.6  are  defined  in  the 
causal  model.  By  focusing  on  a single  subcategory  and 
utilizing  the  causal  associations  stored,  the  system  has 
generated  a small  subset  of  disorder  candidates  which  explain 
all  the  observed  solder  defects.  By  applying  three  constraints 
to  those  candidates,  the  system  further  limits  the  number  of 
disorder  candidates  considered  and  eliminates  non-plausible 
disorder  candidates.  Figure  5.7  shows  some  experimental 
results  which  include  some  intermediate  results  to  show  how 
the  system  limits  the  number  of  disorder  (defect  cause) 
candidates  considered.  Among  224  disorder  candidates  in  the 
initial  hypotheses  space,  the  system  hypothesized  24  defect 
causes  which  explained  the  observed  defects  in  the  example  of 
Figure  5.7  (b)  . By  applying  three  constraints  defined  in  the 
constraint-based  filter,  the  system  further  reduced  the  number 
of  defect  causes  considered.  Finally,  the  system  generated  the 
ranked  disorder  candidates  to  help  the  users  make  a decision 
for  the  process  control. 


n o o 
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C Definition  of  all  the  possible  disorders  (defect  causes) 

C involved  in  the  wave  solder  process  in  PCB  manufacturing 
C operation 

D ( 1) = ' solder  temperature  high ' 

D(2)=' solder  temperature  low' 

D(3)=' immersion  depth  high' 

D (4)=' immersion  depth  low' 

D( 5)=' solder  wave  uneven' 

D(6)=' solder  contaminated' 

D (7) =' excessive  solder  dross' 

D(8) =' preheat  temperature  high' 

D (9)=' preheat  temperature  low' 

D( 10)= 'flux  contaminated' 

D(ll)='flux  specific  gravity  low' 

D(12)='flux  specific  gravity  high' 

D(13)='flux  no  longer  active' 

D( 14)= 'flux  not  making  contact' 

D( 15)= 'flux  formhead  low' 

D ( 16) = ' f luxer  uneven' 

D(17)='flux  blow-off  excessive' 

D(18)='no  flux  blow-off' 

D( 19)= 'pallet  too  hot' 

D (20) =' conveyor  speed  high' 

D (21) =' conveyor  speed  low' 

D (22)= 'conveyor  vibration' 

D (23)= 'early  removal  of  board' 

D(24)= 'board  not  seated  right' 


Definition  of  all  the  possible  manifestations  (defects) 
involved  in  the  wave  solder  process  in  PCB  manufacturing 
operation 

M(l) =' excessive  solder' 

M(2)=' insufficient  solder' 

M(3)='no  solder' 

M(4)=' solder  bridge' 

M ( 5) = ' icicles ' 

M(6)='poor  wetting  leads' 

M(7)='poor  wetting  lands' 

M(8)='pin  holes  or  voids' 

M ( 9 ) = ' contaminated ' 

M ( 10 ) = ' disturbed ' 

M ( 11) =' solder  on  laminate  or  contact' 

M( 12)= 'grainy  solder' 

M(13)='flux  entrapment' 


Figure  5.6  Wave  Solder  Defects  and  Defect  Causes  Encoded  in 
the  Diagnostic  System 
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THE  OBSERVED  SOLDER  DEFECTS  ARE:  3 

EXCESSIVE  SOLDER 
SOLDER  BRIDGE 
FLUX  ENTRAPMENT 


THE  MAXIMUM  CARDINALITY  IS:  2 


THE  HYPOTHESIZED  DEFECT  CAUSES  ARE:  6 


CONVEYOR  SPEED  HIGH 
PREHEAT  TEMPERATURE 
PREHEAT  TEMPERATURE 
PREHEAT  TEMPERATURE 
PREHEAT  TEMPERATURE 
PREHEAT  TEMPERATURE 


LOW 

.AND 

LOW 

.AND 

LOW 

.AND 

LOW 

.AND 

LOW 

.AND 

SOLDER  TEMPERATURE  LOW 
SOLDER  WAVE  UNEVEN 
BOARD  NOT  SEATED  RIGHT 
FLUXER  UNEVEN 
PALLET  TOO  HOT 


THE  EXCLUDED  DEFECT  CAUSES  ARE:  1 

PREHEAT  TEMPERATURE  LOW  .AND.  PALLET  TOO  HOT 


THE  RANKED  DEFECT  CAUSES  ARE:  5 

(1)  CONVEYOR  SPEED  HIGH 

(2)  PREHEAT  TEMPERATURE  LOW  .AND.  SOLDER  TEMPERATURE  LOW 

(3)  PREHEAT  TEMPERATURE  LOW  .AND.  SOLDER  WAVE  UNEVEN 

(4)  PREHEAT  TEMPERATURE  LOW  .AND.  FLUXER  UNEVEN 

(5)  PREHEAT  TEMPERATURE  LOW  .AND.  BOARD  NOT  SEATED  RIGHT 


(a) 


Figure  5.7  Experimental  Results  (a)  QM  = {excessive  solder, 
solder  bridge,  flux  entrapment)  (b)  QM  = {pin 
holes  or  voids,  contaminated,  disturbed) 
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THE  OBSERVED  SOLDER  DEFECTS  ARE:  3 

PIN  HOLES  OR  VOIDS 

CONTAMINATED 

DISTURBED 


THE  MAXIMUM  CARDINALITY  IS:  2 


THE  HYPOTHESIZED  DEFECT  CAUSES  ARE:  24 


CONVEYOR  SPEED  HIGH 


SOLDER 
SOLDER 
SOLDER 
SOLDER 
SOLDER 
SOLDER 
SOLDER 
CONVEYOR 
CONVEYOR 
CONVEYOR 
CONVEYOR 
CONVEYOR 
CONVEYOR 
CONVEYOR 


TEMPERATURE  HIGH 
TEMPERATURE  HIGH 
TEMPERATURE  HIGH 
TEMPERATURE  HIGH 
TEMPERATURE  HIGH 
TEMPERATURE  HIGH 
TEMPERATURE  HIGH 
SPEED  LOW  .AND. 
SPEED  LOW  .AND. 
SPEED  LOW  .AND. 
SPEED  LOW  .AND. 
SPEED  LOW  .AND. 
SPEED  LOW  .AND. 
SPEED  LOW 


.AND.  BOARD 
PREHEAT  TEMPERATURE  HIGH  .AND. 
.AND.  PREHEAT  TEMPERATURE  LOW 
PREHEAT  TEMPERATURE  HIGH  .AND. 


.AND.  SOLDER  TEMPERATURE  LOW 
.AND.  IMMERSION  DEPTH  LOW 
.AND.  SOLDER  WAVE  UNEVEN 
.AND.  EXCESSIVE  SOLDER  DROSS 
.AND.  CONVEYOR  VIBRATION 
.AND.  EARLY  REMOVAL  OF  BOARD 
.AND.  BOARD  NOT  SEATED  RIGHT 
SOLDER  TEMPERATURE  LOW 
IMMERSION  DEPTH  LOW 
SOLDER  WAVE  UNEVEN 
EXCESSIVE  SOLDER  DROSS 
CONVEYOR  VIBRATION 
EARLY  REMOVAL  OF  BOARD 
NOT  SEATED  RIGHT 
EXCESSIVE  SOLDER  DROSS 


CONVEYOR  VIBRATION 


.AND.  PREHEAT  TEMPERATURE  LOW 

PREHEAT  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 
.AND.  PREHEAT  TEMPERATURE  LOW 

PREHEAT  TEMPERATURE  HIGH  .AND.  EXCESSIVE  SOLDER  DROSS 
.AND.  PALLET  TOO  HOT 

PREHEAT  TEMPERATURE  HIGH  .AND.  CONVEYOR  VIBRATION 
.AND.  PALLET  TOO  HOT 

PREHEAT  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 
.AND.  PALLET  TOO  HOT 

PREHEAT  TEMPERATURE  HIGH  .AND.  EXCESSIVE  SOLDER  DROSS 
.AND.  FLUXER  UNEVEN 

PREHEAT  TEMPERATURE  HIGH  .AND.  CONVEYOR  VIBRATION 
.AND.  FLUXER  UNEVEN 

PREHEAT  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 
.AND.  FLUXER  UNEVEN 


Figure  5.7 


Continued 
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THE  EXCLUDED  DEFECT  CAUSES  ARE:  15 


SOLDER  TEMPERATURE  HIGH  .AND.  SOLDER  TEMPERATURE  LOW 
SOLDER  TEMPERATURE  HIGH  .AND.  IMMERSION  DEPTH  LOW 
SOLDER  TEMPERATURE  HIGH  .AND.  SOLDER  WAVE  UNEVEN 
CONVEYOR  SPEED  LOW  .AND.  IMMERSION  DEPTH  LOW 
CONVEYOR  SPEED  LOW  .AND.  SOLDER  WAVE  UNEVEN 
CONVEYOR  SPEED  LOW  .AND.  SOLDER  TEMPERATURE  LOW 
PREHEAT  TEMPERATURE  HIGH  .AND.  EXCESSIVE  SOLDER  DROSS 
•AND.  PREHEAT  TEMPERATURE  LOW 

PREHEAT  TEMPERATURE  HIGH  .AND.  CONVEYOR  VIBRATION 
.AND.  PREHEAT  TEMPERATURE  LOW 

PREHEAT  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 
•AND.  PREHEAT  TEMPERATURE  LOW 

PREHEAT  TEMPERATURE  HIGH  .AND.  EXCESSIVE  SOLDER  DROSS 
.AND.  PALLET  TOO  HOT 

PREHEAT  TEMPERATURE  HIGH  .AND.  CONVEYOR  VIBRATION 
.AND.  PALLET  TOO  HOT 

PREHEAT  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 
•AND.  PALLET  TOO  HOT 

PREHEAT  TEMPERATURE  HIGH  .AND.  EXCESSIVE  SOLDER  DROSS 
•AND.  FLUXER  UNEVEN 

PREHEAT  TEMPERATURE  HIGH  .AND.  CONVEYOR  VIBRATION 
.AND.  FLUXER  UNEVEN 

PREHEAT  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 
•AND.  FLUXER  UNEVEN 


THE  RANKED  DEFECT  CAUSES  ARE:  9 

(1)  CONVEYOR  SPEED  HIGH 

(2)  SOLDER  TEMPERATURE  HIGH  .AND.  CONVEYOR  VIBRATION 

(3)  CONVEYOR  SPEED  LOW  .AND.  CONVEYOR  VIBRATION 

(4)  CONVEYOR  SPEED  LOW  .AND.  EARLY  REMOVAL  OF  BOARD 

(5)  CONVEYOR  SPEED  LOW  .AND.  BOARD  NOT  SEATED  RIGHT 

(6)  SOLDER  TEMPERATURE  HIGH  .AND.  BOARD  NOT  SEATED  RIGHT 

(7)  SOLDER  TEMPERATURE  HIGH  .AND.  EXCESSIVE  SOLDER  DROSS 

(8)  SOLDER  TEMPERATURE  HIGH  .AND.  EARLY  REMOVAL  OF  BOARD 

(9)  CONVEYOR  SPEED  LOW  .AND.  EXCESSIVE  SOLDER  DROSS 


(b) 


Figure  5.7  Continued 


CHAPTER  VI 

SUMMARY  AND  CONCLUSIONS 
6 . 1 Summary 

The  design  of  knowledge-based  expert  systems  for  fault 
diagnosis  and  supervisory  control  has  received  a good  deal  of 
attention  among  knowledge  engineers,  resulting  in  improved 
efficiency  and  effectiveness.  The  fault  diagnosis  and  control 
in  a distributed  process  environment  is  a knowledge-intensive 
and  experience-based  task,  which  in  complex  processes  can 
sometimes  go  beyond  the  capabilities  of  skilled  operators  and 
engineers . 

In  this  dissertation  we  presented  a new  approach  to  fault 
diagnosis  and  control  in  distributed  process  environments  by 
integrating  probabilistic  pattern-directed  inference  with  a 
symbolic  model  of  diagnostic  reasoning  based  on  the 
categorical  causal  model.  Specifically,  the  categorical  causal 
model  is  combined  with  a formal  model  of  diagnostic  inference 
referred  to  as  PADIKS  (Pattern-Directed  Knowledge-Based 
System)  developed  at  the  Center  for  Information  Research.  The 
data  integration  technique  is  designed  to  put  into  a knowledge 
base  the  spreading  of  the  observable  information  over  the 
shopfloor,  to  interrelate  element  based  on  the  entity- 
category-relation  structure,  and  to  generate  error-free 
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hierarchical  category  files.  The  data  integration  techniques 
facilitate  the  knowledge  retrieval  and  the  diagnostic 
reasoning  as  a data  preprocessing  operation. 

The  proposed  diagnostic  mechanism,  which  consists  of 
three  different  levels  of  reasoning  operations  (i.e.,  the 
model-based  filter,  the  constraint-based  filter,  and  the 
pattern-directed  ranker) , significantly  reduces  the 
computational  complexity  in  the  diagnostic  problem  with 
uncertainty  by  systematically  shrinking  the  hypotheses  space. 
This  approach  is  applied  to  the  test  and  inspection  data 
collected  from  a PCB  manufacturing  operation. 

6.2  Significance  of  This  Research 

The  main  accomplishments  of  this  research  are  summarized 
as  follows: 

(1)  The  data  integration  technique  generates  the  error-free, 
well-structured  category  files  which  facilitate  knowledge 
retrieval  and  diagnostic  reasoning.  The  proposed 
technique  consists  of  four  distinctive  operations,  i.e., 
data  normalization,  data  verification,  data  validation, 
and  data  association.  Based  on  the  entity-category- 
relation  structure,  the  data  integration  not  only 
eliminates  the  possibility  of  wrong  diagnosis  due  to  the 
erroneous  data,  but  also  generates  the  hierarchical 
category  file. 

(2)  As  an  enhanced  version  of  the  PADIKS  approach,  the  multi- 
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level  diagnostic  mechanism  is  designed  to  provide  a 
highly  efficient  diagnostic  procedure  by  systematically 
shrinking  the  hypotheses  space.  This  layered  architecture 
also  maintains  a great  flexibility  in  modifying  the 
diagnostic  reasoning  scheme. 

(3)  Combined  with  the  set  covering  theory,  the  concept  of 
categorical  causal  model  is  proposed  to  reduce  the 
computational  complexity  involved  in  a general  diagnostic 
problem,  especially  in  a multiple  disorders  problem.  This 
categorical  causal  model  is  constructed  on  the  basis  of 
the  subprocess  configuration  in  a distributed  process 
environment. 

(4)  Three  important  constraints  (i.e.,  the  expected  feature 
constraint,  the  disorder  dependence  constraint,  and  the 
cardinality  constraint)  are  deliberately  designed  to 
eliminate  the  non-plausible  disorder  candidates 
hypothesized  in  the  model-based  causal  reasoning.  The 
concept  of  partitioning  the  feature  space  is  introduced 
in  the  expected  feature  constraint  to  justify  the 
hypothesized  candidates.  The  disorder  dependence 
constraint  also  compensates  the  disorder  independence 
assumption  which  is  widely  used  in  a complex  problem 
domain. 

(5)  In  the  pattern-directed  ranker,  the  PADIKS  approach  is 
revised  to  effectively  handle  a multiple  disorders 
problem.  To  prevent  the  combinatorial  explosion  of  the 
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multiple  disorders  problem,  the  pattern-directed  ranker 
focuses  on  a small  subset  of  the  initial  hypotheses 
space. 


6.3  Areas  for  Future  Work 

In  this  dissertation,  we  have  presented  some  principles 
and  techniques  in  designing  knowledge-based  diagnostic 
systems.  Although  the  proposed  approach  has  shown  encouraging 
results,  it  is  by  no  means  complete.  More  research  work  is 
needed  to  enhance  the  design  concepts  proposed  for  the 
diagnostic  reasoning  mechanism.  For  a more  general  diagnostic 
problem  solving,  we  recommend  that  further  research  be  carried 
out  on  the  following  directions: 

(1)  Although  the  concept  of  the  categorical  causal  model 
presented  in  Chapter  IV  is  very  powerful  in  dealing  with 
problems  with  classif icatory  properties,  theoretically 
justifiable  criteria  of  categorizing  the  local  causal 
models  are  required  to  obtain  the  best  performance  in  the 
diagnostic  problem  solving.  The  minimization  of  the 
number  of  the  overlapped  manifestations  between  local 
causal  models  discussed  in  Chapter  III  can  be  one  of  the 
possible  criteria. 

(2)  In  a general  diagnostic  problem,  we  may  have  intermediate 
states  which  are  not  directly  observable,  i.e.,  not 
manifestations,  but  are  also  not  the  hypothesized 
candidates,  i.e.,  not  disorders.  To  handle  situations 
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where  causal  chaining  exists,  these  intermediate  states 
should  be  included  in  the  causal  model.  Thus,  to  expand 
the  generality  and  robustness  of  the  proposed  multi-level 
diagnostic  mechanism,  the  strategy  of  including  these 
intermediate  states  is  needed  for  the  categorical  causal 
model . 

(3)  In  generating  the  strength  of  causal  association  for  the 
pattern-directed  ranker,  currently  we  rely  on  the  past 
history  of  process  control.  That  is,  the  disorder 
concepts  are  formed  by  training  samples  in  the  past 
history.  However,  to  handle  the  situations  where  the 
hypothesized  multiple  disorders  are  unseen  in  the  past, 
the  concept  formation  technique  should  be  developed  to 
generate  a new  disorder  concept  by  utilizing  the  existing 
disorder  concepts. 

After  several  years  of  research  in  the  field  of 
intelligent  information  systems,  we  have  come  to  believe  that 
there  exist  solutions  to  the  problems  mentioned  above.  We  hope 
that  more  researchers  will  take  up  these  challenging  problems 
in  an  effort  to  develop  a more  general  diagnostic  system  for 
a distributed  process  environment,  and  we  hope  that  this 
dissertation  can  contribute  to  the  realization  of  those 
objectives  in  a short  time. 


APPENDIX 

ARCHITECTURAL  FRAMEWORK 
FOR  INTEGRATING  DISTRIBUTED  SUBPROCESSES 

As  discussed  in  Chapter  I,  the  distributed  process 
consists  of  various  subprocesses  which  cooperate  to  perform  a 
specific  task.  Therefore,  one  of  the  main  technical  problems 
in  a distributed  process  environment  is  how  to  integrate 
distributed,  heterogeneous  information  systems  (i.e., 
subprocesses)  so  that  real  integration  and  full 
interchangeability  of  isolated  subprocesses  can  be  achieved. 
To  achieve  this  goal,  our  approach  is  to  conceptualize 
subprocesses  as  distributed  information  systems  by  decomposing 
each  subprocess  into  functional  components  and  integrating 
them  at  the  semantic  level  (Tou,  Chung,  Park,  and  Jeong 
[1988] ) . 

Here  we  present  a new  reference  model  as  an  architectural 
framework  for  each  subprocess  in  a distributed  information 
process  environment.  In  designing  the  standard  reference 
model,  three  principal  features  of  the  distributed  process  are 
considered,  which  are  (1)  distributed  (network),  (2)  data- 
intensive  (data),  and  (3)  task-specific  (control).  These 
distinctive  modules,  i.e.,  network,  data,  and  control, 
characterize  each  subprocess  as  a subcomponent  of  an  overall 
distributed  process  system.  In  order  to  facilitate  the 


170 


171 


information  sharing  between  distributed  subprocesses,  a new 
reference  model  is  designed  on  the  basis  of  the  well-known 
ISO's  OSI/RM  (International  Standard  Organization's  Open 
System  Interconnection  / Reference  Model)  and  the  concept  of 
three  primary  modules. 

A. 1 Functional  Decomposition  and  Integration  of  Subprocess 

By  considering  primary  modules  in  each  subprocess,  the 
functional  decomposition  and  the  integration  of  each  separate 
module  are  considered  to  develop  the  new  concept  of  reference 
model  for  a more  general,  flexible  standard  architecture  of 
distributed  subprocesses. 

Functional  Decomposition 

In  data-intensive  information  systems  it  is  usual  to  make 
a separation  between  programs  and  data,  namely,  program-data 
independence.  That  is,  changes  in  data  structure  should  not 
require  a change  in  program  structure  and  vice  versa.  This 
separation  introduces  the  well-known  three-schema  architecture 
for  database:  external,  internal,  and  conceptual  schema.  In 
distributed  process  environment,  the  advent  of  the  distributed 
database  introduces  a complication  that  there  may  be  as  many 
internal  schemas  as  there  are  subprocesses  that  store  parts  of 
the  database.  This  introduced  the  problem  of  data 
communication  into  the  internal  level  of  database  management 
in  each  subprocess. 

Distributed  subprocesses  such  as  workcells  (or 
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workstations)  in  the  automated  manufacturing  floor  are  not 
typically  data-intensive,  but  have  distributed  extremely 
characteristics.  That  is,  programs  are  mainly  concerned  with 
communication  between  subprocesses.  The  requirement  of 
flexibility  and  replaceability  of  computerized  components  has 
led  to  efforts  such  as  MAP  (Manufacturing  Automation  Protocol) 
which  aims  at  program-communication  independence  (Schutz 
[1987]).  The  increasing  complexity  of  such  applications 
results  in  growing  amounts  of  stored  data,  and  so  introduces 
the  data  management  problem  into  programs. 

It  is  remarkable  that  one  subprocess,  aiming  at  program- 
data  independence,  introduces  the  communication  problem  into 
its  internal  database  management,  while  the  other  subprocess, 
aiming  at  program-communication  independence,  introduce  the 
data  management  problem  into  their  external  network  management 
(Pels  and  Wegter  [1987])  . This  observation  leads  us  to  suggest 
that  in  both  areas  a strict  separation  should  be  introduced 
between  the  following  three  modules:  data,  network,  and 
control.  Also,  we  generalize  the  distinction  between 
conceptual,  external,  and  internal  levels,  as  in  the  ANSI 
three-schema  architecture,  for  three  modules  (Tsichritzis  and 
Lochosky  [1982]).  This  concept  presents  a system  architecture 
which  characterizes  each  subprocess  by  9 separate  schemas 
spreading  over  three  modules  and  three  levels  as  shown  in 
Figure  A-l. 
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Figure  A.l  Three  Modules/Three  Levels  Schemas  Architecture 


174 


Conceptual  Level  Integration 

Because  the  conceptual  level  is  that  level  where 
standards  for  the  semantical  aspect  of  integration  can  and 
should  be  developed,  the  conceptual  level  of  integration  is 
presented.  When  the  notion  of  property  of  data  is  introduced, 
the  conceptual  database  schema  can  be  partitioned  into 
distinct  modules  that  correspond  to  logical  functions  in  the 
system.  To  be  able  to  model  the  communication  between  modules, 
three  different  scopes  are  distinguished  for  data  elements 
that  are  known  to  a specific  module:  private,  public,  and 
foreign. 

Different  modules  can  be  integrated  into  larger  modules 
by  binding  the  foreign  data  elements  of  one  module  to  public 
data  elements  of  the  other  modules.  The  specification  of  the 
bonds  between  foreign  and  public  data  introduces  the  network 
aspect  into  the  conceptual  data  schema.  This  may  be  regarded 
as  a new  concept  since  known  literatures  about  network  and 
distributed  database  only  consider  the  physical  distribution 
of  program  and  data  over  physical  processors  and  storage 
devices  at  different  locations.  The  division  of  the  conceptual 
database  schema  into  database  modules  makes  no  assumptions 
concerning  the  physical  location  of  the  data  elements  of  the 
modules . 

Figure  A. 2 depicts  how  the  three  conceptual  schemas  are 
interfaced  with  transaction,  data  transfer,  and  trigger.  The 
conceptual  data  schema  can  be  specified  in  two  distinct  parts: 
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Figure  A. 2 


Functional  Decomposition  and  Integration  of 
Distributed  Subprocesses 
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the  specification  of  data  elements  and  the  specification  of 
transactions.  Data  elements  are  specified  by  domain,  name,  and 
value.  The  value  can  be  simple  like  Boolean,  or  integer,  or 
complex  like  a table.  The  transactions  are  the  only  means  by 
which  changes  of  value  can  be  invoked. 

The  function  of  the  conceptual  control  schema  is  to 
specify  atomic  units  of  operations,  called  event-procedure.  A 
module  can  be  reactivated  by  a trigger  from  another  module.  An 
event-procedure  will  be  composed  of  transaction  statements, 
trigger  statement  and  conventional  structured  control 
statement  (IF  THEN  ELSE,  WHILE,  CASE  etc.).  A conceptual 
control  schema  consists  of  the  specifications  of  event 
procedures  and  triggers.  The  conceptual  network  schema  of  a 
module  specifies  how  one  or  more  other  modules  are  integrated 
into  a larger  conceptual  module.  The  smaller  modules  are 
designated  as  the  nodes  of  the  larger  module.  A data  transfer 
provides  the  geometrical  linkage  of  data  elements  in  different 
modules. 

A. 2 New  Reference  Model  for  Distributed  Subprocess 

In  order  to  identify  and  separate  architectural 
components  for  a distributed  subprocess,  we  present  a new 
reference  model  based  on  our  conceptual  decomposition  and 
integration  techniques  discussed  in  the  previous  section. 

ISO's  OSI/RM.  The  ISO's  OSI/RM  has  widely  been  used  as 
a reference  architecture  of  distributed  information  systems. 
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This  standard  reference  model,  however,  tends  to  force  all 
control  and  data  management  aspects  out  of  the  internal 
network  schema,  by  consequently  lifting  them  to  the 
application  layer.  Therefore  OSI/RM  seems  to  emphasize  only 
the  properties  of  network  module.  However,  the  reference  model 
for  a distributed  information  subprocess  should  consider  other 
aspects  for  a modular  and  flexible  architecture. 

Database  system  architecture.  One  the  first  major 
architecturally  oriented  efforts  to  examine  the  nature  of 
computer-based  information  systems  was  the  effort  of  the 
ANSI/ SPARC/ Study  Group  on  Database  Management  to  identify  the 
architecture  of  database  management  systems.  But  this 
reference  architecture  seems  to  be  inadequate  to  describe  the 
distributed  database  management  system  since  the  current  model 
makes  no  distinction  between  database  management  and 
communication.  Also  this  model  does  not  give  a clear 
distinction  for  the  control  aspect  of  distributed  subprocess. 
The  notion  of  common  conceptual  schema  over  three  aspects  in 
each  subprocess  makes  distribution  invisible  at  the  conceptual 
level,  and  forces  the  network  aspect  to  the  internal  level  of 
data.  This  leads  to  unnecessary  complication  of  the  internal 
specification  and  to  incompatibility  of  the  database  approach 
with  the  network  approach  (OSI/RM) , where  the  implementation 
of  the  OS I standards  locates  all  control  and  data  management 
aspects  at  the  Application  Layer. 

Especially  in  a distributed  process  environment,  the 
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Figure  A. 3 Proposed  Reference  Model  for  Distributed 
Subprocess 
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general  reference  model  should  consider  the  three  modules 
(aspects)  of  each  subprocess  with  equal  importance.  In  order 
to  remove  the  incompatibility  between  current  database  model 
and  OSI/RM  and  to  provide  the  designated  separation,  a new 
reference  model  is  designed  as  shown  Figure  A. 3.  An 
interesting  aspect  of  this  merged  diagram  is  the  fact  that 
Presentation  Layer  of  the  network  module  and  the  Data 
Independence  Access  Layer  of  the  data  module  have  been  merged 
into  a single  layer  in  the  control  module,  which  supports  data 
transformations  for:  (1)  interprocess  communications,  (2)  data 
storage  and  retrieval,  and  (3)  control  for  the  specific  task 
in  each  subprocess.  This  diagram  also  identifies  a new  layer 
(or  sublayer  within  the  Application  Layer)  which  is  concerned 
with  those  application  management  functions,  such  as 
application  process  integrity  (initiation,  checkpoint, 
recovery,  commitment  and  termination) . 
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